Settings: Drift detection
H2O Model Validation offers an array of settings for a drift detection test. Below, each setting is described in turn.
Test name
Defines the name for the validation test; by default, H2O Model Validation will assign a name to the validation test that you can rewrite.
Primary Dataset
Defines one of the two datasets H2O Model Validation uses during the validation test to identify changes in the distribution of variables between the Primary and Secondary Datasets. H2O Model Validation performs drift detection using the Primary and Secondary Datasets captured at different times to assess how data has changed over time.
Models: Within the context of validating a model, the defined Primary Dataset needs to follow the structure of the model's training dataset.
Secondary Dataset
Defines one of the two datasets H2O Model Validation uses during the validation test to identify changes in the distribution of variables between the Primary and Secondary Datasets. H2O Model Validation performs drift detection using the Primary and Secondary Datasets captured at different times to assess how data has changed over time.
The defined Primary Dataset dictates the required format of the Secondary Dataset (similar columns).
Columns to drop
Defines the columns H2O Model Validation drops during the validation test. Typically drop columns refer to columns that can indicate a drift without an impact on the model, like columns not used by the model, record IDs, time columns, etc.
- Submit and view feedback for this page
- Send feedback about H2O Model Validation to cloud-feedback@h2o.ai