View a dataset's summary
Overview
An imported dataset contains a dataset summary that you can view anytime. A dataset summary contains an array of summary metrics, for example, count, mean, STD, min, max, missing, etc.
Instructions
To access a dataset summary, consider the following instructions:
- In the H2O Model Validation navigation menu, click Datasets.
- In the datasets table, click the name of the dataset you want to view its dataset summary.
note
- In particular, viewing a dataset summary displays the following:
Dataset summary table
note
H2O Model Validation marks feature columns with N/A (not applicable) if the column feature value is non-numeric.
Column name | Description |
---|---|
Feature | Feature name (one of the column names in the dataset) |
Data type | Data type (for example, Int) |
Count | Number (count) of value features present in the feature column |
Missing | Number of missing feature values |
Missing ratio | Percentage of missing feature values (N) (N/ Number of rows) |
Mean | The typical feature value |
Standard deviation | The standard deviation (a measure of divergence or distribution) of the feature values |
Min | The minimum feature value |
Max | The maximum feature value |
Number of unique | Unique feature values |
Frequency | The feature frequency value |
Datetime format | The detected date string format (for example, yy-mm-dd, hh-mm-ss (hourly granularity) or dd-mm-yy) |
Datetime unit | The highest unit of time, based on incremental date changes (for example, seconds, minutes, hours, days, or weeks) |
Histogram: Feature histogram
The Feature histogram displays the data frequency distribution of the selected feature in the Dataset summary table.
Feedback
- Submit and view feedback for this page
- Send feedback about H2O Model Validation to cloud-feedback@h2o.ai