Skip to main content
Version: v0.16.0

Metrics: Backtesting

H2O Model Validation offers an array of metrics to understand a Backtesting test. Below, each metric is described in turn.

Graph: Backtesting {metric}

The graph displays the dynamics of the backtested model's scorer value through time. In this case, {metric} refers to the model's scorer. In addition, you can use the graph to see the dynamics of the model accuracy while discovering if accuracy depends on time. You can also use the graph to investigate past environmental changes during data collection that led to drops in model performance.

  • Y-axis: Model scorer (Backtesting {metric})
  • X-axis: Date (Backtesting splits)
  • Cross-validation: Cross-validation metric values calcualted on train datasets
  • Back-test: Backtesting metric values calculated on test datasets

backtesting-metric.png

Graph: Target distributions

The graph displays the target distribution of the backtesting train and test dataset splits. You can use this graph to investigate model accuracy drops in the past due to a change in the target variable over time.

  • Y-axis: Target column values
  • X-axis: Split dates
  • Train: Target distribution values of the backtesting training dataset
  • Back-test: Target distribution values of the backtesting test dataset

target-distribution.png

Heatmap: Feature importance for different split dates

The heatmap visualizes the most important features of the backtested models. The heatmap is helpful when investigating how variable importance evolves.

  • Rows: Raw input variables
  • Columns: Backtesting splits
  • Heatmap values: Feature importance scorers

feature-importance-for-different-split-dates.png

Table: Models

The table displays the Driverless AI experiments corresponding to backtesting models at each split.

Column nameDescription
#Experiment number
EnsembleEnsemble of models used in the Driverless AI experiment with their weights
Best modelBest machine learning (ML) model used in the Driverless AI experiment
Train dataset time spanStart and end date of the training data used in the Driverless AI experiment
Train dataset sizeSize of the training data used in the DriverlessAI experiment
Test dataset time spanStart and end date of the test data used in the Driverless AI experiment
Test dataset sizeSize of the test data used in the Driverless AI experiment
Best featureBest feature information of the Driverless AI experiment

Feedback