Skip to main content
Version: Next

Metrics: Backtesting

Overview

H2O Model Validation offers an array of metrics to understand a backtesting test. Below, each metric is described in turn.

Settings

Backtesting {metric}

The backtesting {metric} graph displays the dynamics of the backtested model's scorer value through time. In this case, {metric} refers to the model's scorer. In addition, you can use the graph to see the dynamics of the model accuracy while discovering if accuracy depends on time. You can also use the graph to investigate past environmental changes during data collection that led to drops in model performance.

  • Y-axis: Model scorer (backtesting {metric})
  • X-axis: Date (backtesting splits)
  • Cross-validation: Cross-validation metric values calcualted on train datasets
  • Back-test: Backtesting metric values calculated on test datasets

backtesting-metric.png

Target distributions (linear scale)

The target distributions (linear scale) graph displays the target distribution of the backtesting train and test dataset splits. You can use this graph to investigate model accuracy drops in the past due to a change in the target variable over time.

  • Y-axis: Target column values
  • X-axis: Split dates
  • Train: Target distribution values of the backtesting training dataset
  • Back-test: Target distribution values of the backtesting test dataset

Target distribution (linear scale)

note

For regression models, you can change the target distribution graph from a linear to a log scale. To switch between the different scales, consider the following instructions:

  1. Click More (within the target distributions (linear scale) graph card).
    • To switch to linear scale, select Linear scale.
    • To switch to logarithmic scale, select Log scale.

Target distribution (log scale)

Feature importance for different split dates

The feature importance heatmap visualizes the most important features of the backtested models. The heatmap is helpful when investigating how variable importance evolves.

  • Rows: Raw input variables
  • Columns: Backtesting splits
  • Heatmap values: Feature importance scorers

feature-importance-for-different-split-dates.png

note

The heatmap does not display features not utilized in the model. Also, the heatmap does not display the target variable.

Models tab

Models table

The models table displays the models corresponding to each split.

Column nameDescription
#Experiment number.
EnsembleEnsemble models used in the Driverless AI experiment with their weights during the backtesting test.
Best modelIt refers to the best machine learning (ML) model used in the Driverless AI experiment for the backtesting test.
Train dataset time spanStart and end date of the training data used in the Driverless AI experiment.
Train dataset sizeSize of the training data used in the DriverlessAI experiment.
Test dataset time spanStart and end date of the test data used in the Driverless AI experiment.
Test dataset sizeSize of the test data used in the Driverless AI experiment.
Best featureBest feature information of the Driverless AI experiment.

Feedback