Settings: Size dependency
H2O Model Validation offers an array of settings for a size dependency test. Below, each setting is described in turn.
Test name
Defines the name for the validation test; by default, H2O Model Validation assigns a name to the validation test that you can rewrite.
Model
Defines the model H2O Model Validation utilizes to run the size dependency test.
Model train dataset
Model train dataset refers to one of the model's informational points, not a setting. This informational point refers to the model's training dataset that H2O Model Validation utilizes during the test to analyze the effects different training data sizes have on the model's accuracy. The model's training dataset is one of the two datasets H2O Model Validation uses during the test. To learn more, see Size dependency.
Model test dataset
Model test dataset refers to one of the model's informational points, not a setting. This informational point refers to the model's test dataset that H2O Model Validation utilizes during the test to analyze the effects different training data sizes have on the model's accuracy. The model's test dataset is one of the two datasets H2O Model Validation uses during the test. To learn more, see Size dependency.
Time column
Defines the time column of the primary and secondary dataset, which H2O Model Validation utilizes to perform time-based splits.
If the selected model already used a time column during training, H2O Model Validation automatically utilizes the used time column during training as the Time column.
Number of splits
Defines the number of splits that H2O Model Validation performs on the primary dataset to assess dataset size dependency.
The more splits, the better; it maximizes the test's explanatory power. Each split leads to an extra experiment run. This setting affects the runtime.
Delete test models and datasets from the Worker after finish
Determines if H2O Model Validation should delete the artifacts created in the Platform connected to the Worker Connection. In this case, artifacts refer to experiments and datasets generated during the backtesting validation test. By default, H2O Model Validation checks this setting (enables it), and accordingly, H2O Model Validation deletes all artifacts because they are no longer needed after the validation test is complete.
- Submit and view feedback for this page
- Send feedback about H2O Model Validation to cloud-feedback@h2o.ai