Model Performance on Another Dataset

The Diagnose Model on New Dataset option lets you view model performance for multiple scorers based on existing model and dataset.

On the completed experiment page, click the Diagnose Model on New Dataset button.

Note: You can also diagnose a model by selecting Diagnostics from the top menu, then selecting an experiment and test dataset.

Completed experiment menu

Select a dataset to use when diagnosing this experiment. Note that the dataset must include the target column that is in the original dataset. At this point, Driverless AI will begin calculating all available scores for the experiment.

When the diagnosis is complete, it will be available on the Model Diagnostics page. Click on the new diagnosis. From this page, you can download predictions. You can also view scores and metric plots. The plots are interactive. Click a graph to enlarge. In the enlarged view, you can hover over the graph to view details for a specific point. You can also download the graph in the enlarged view.

Classification Metric Plots

Classification metric plots include the following graphs:

  • ROC Curve

  • Precision-Recall Curve

  • Cumulative Gains

  • Lift Chart

  • Kolmogorov-Smirnov Chart

  • Confusion Matrix

Diagnosing a model - classification

Note: In the Confusion Matrix graph, the default threshold value is determined based on the threshold value that maximizes the F1 metric on the diagnostic data. For binary classification experiments, users can specify a different threshold value. The threshold selector is available after clicking on the Confusion Matrix and opening the enlarged view. When you specify a value or change the slider value, Driverless AI automatically computes a diagnostic Confusion Matrix for that given threshold value.

Threshold Picker

Regression Metric Plots

Regression metric plots include the following graphs:

  • Actual vs Predicted

  • Residual Plot with LOESS curve

  • Residual Histogram

Diagnosing a model - regression