Skip to main content
Version: v0.16.0

Settings: Drift detection

H2O Model Validation offers an array of settings for a drift detection test. Below, each setting is described in turn.

Test name

Defines the name for the validation test; by default, H2O Model Validation assigns a name to the validation test that you can rewrite.

Primary dataset

Defines one of the two datasets H2O Model Validation uses during the validation test to identify changes in the distribution of variables between the primary and secondary datasets. H2O Model Validation performs drift detection using the primary and secondary datasets captured at different times to assess how data has changed over time.

note

Models: Within the context of validating a model, the defined primary dataset needs to follow the structure of the model's training dataset.

Secondary dataset

Defines one of the two datasets H2O Model Validation uses during the validation test to identify changes in the distribution of variables between the primary and secondary datasets. H2O Model Validation performs drift detection using the primary and secondary datasets captured at different times to assess how data has changed over time.

note

The defined primary dataset dictates the required format for the secondary dataset (similar columns).

Columns to drop

Defines the columns H2O Model Validation drops during the validation test. Typically drop columns refer to columns that can indicate a drift without an impact on the model, like columns not used by the model, record IDs, time columns, etc.


Feedback