View a dataset's summary
A dataset summary lets you quickly understand an array of insights about a particular dataset in your established Driverless AI (DAI) connection (e.g., count, mean, STD, min, max, missing, etc.).
Instructions
To access a dataset summary, consider the following instructions:
- In the H2O Model Validation navigation menu, click Datasets. note
- On the Datasets card, you can search for your dataset summaries on the datasets table.
- On the Datasets card, in particular, in the datasets table, you can view a dataset summary for all your datasets. To learn more, see Datasets table columns
- In the datasets table, select the dataset summary you want to view.
- Click View.note
A dataset summary table will appear, highlighting several summary metrics about the dataset (e.g., frequency). To learn more, see Dataset summary table.
Datasets table columns
Column name | Description |
---|---|
Name | Dataset name. |
Data Summary | State of the data summary. |
Rows | Row numbers. |
Columns | Column numbers. |
File Size | File size. |
Adversarial Similarity | The number of Adversarial Similarity tests complete and scheduled to run. |
Drift Detection | The number of Drift Detection tests complete and scheduled to run. |
Dataset summary states
As follows are the different types of states a dataset summary can be in:
- NotCreated
- H2O Model Validation has not created the dataset summary.
- Created
- H2O Model Validation created the dataset summary.
- Running
- H2O Model Validation is currently running the dataset summary.
- Done
- H2O Model Validation has completed the dataset summary.
- Deleted
- H2O Model Validation deleted the dataset summary.
- Error
- An error occurred during the dataset summary.
- Timeout
- There was not enough time to complete the dataset summary.
Dataset summary table
Column name | Description |
---|---|
Feature | Feature name (one of the column names in the dataset). |
Count | Count (number) of value features present in the feature column. |
Mean | The typical feature value. |
Std | Feature values standard deviation (a measure of divergence or distribution of the feature values). |
Min | The minimum feature value. |
Max | The maximum feature value. |
Missing | Missing feature values. |
Unique | Unique feature values. |
Freq | The feature frequency value. |
note
H2O Model Validation will mark feature columns with N/A (not applicable) if the column feature value is non-numeric.
Feedback
- Submit and view feedback for this page
- Send feedback about H2O Model Validation to cloud-feedback@h2o.ai