Skip to main content
Version: v0.16.0

Settings: Size dependency

H2O Model Validation offers an array of settings for a size dependency test. Below, each setting is described in turn.

Test name

Defines the name for the validation test; by default, H2O Model Validation assigns a name to the validation test that you can rewrite.

Model

Defines the model H2O Model Validation utilizes to run the size dependency test.

Model train dataset

note

Model train dataset refers to one of the model's informational points, not a setting. This informational point refers to the model's training dataset that H2O Model Validation utilizes during the test to analyze the effects different training data sizes have on the model's accuracy. The model's training dataset is one of the two datasets H2O Model Validation uses during the test. To learn more, see Size dependency.

Model test dataset

note

Model test dataset refers to one of the model's informational points, not a setting. This informational point refers to the model's test dataset that H2O Model Validation utilizes during the test to analyze the effects different training data sizes have on the model's accuracy. The model's test dataset is one of the two datasets H2O Model Validation uses during the test. To learn more, see Size dependency.

Time column

Defines the time column of the primary and secondary dataset, which H2O Model Validation utilizes to perform time-based splits.

note

If the selected model already used a time column during training, H2O Model Validation automatically utilizes the used time column during training as the Time column.

Number of splits

Defines the number of splits that H2O Model Validation performs on the primary dataset to assess dataset size dependency.

note

The more splits, the better; it maximizes the test's explanatory power. Each split leads to an extra experiment run. This setting affects the runtime.

Delete test models and datasets from the Worker after finish

Determines if H2O Model Validation should delete the artifacts created in the Platform connected to the Worker Connection. In this case, artifacts refer to experiments and datasets generated during the backtesting validation test. By default, H2O Model Validation checks this setting (enables it), and accordingly, H2O Model Validation deletes all artifacts because they are no longer needed after the validation test is complete.


Feedback