Skip to main content
Version: Next

View a dataset's summary

Overview

An imported dataset contains a dataset summary that you can view anytime. A dataset summary contains an array of summary metrics, for example, count, mean, standard deviation, min, max, and missing.

Instructions

To access a dataset summary, consider the following instructions:

  1. In the H2O Model Validation navigation menu, click Datasets.
  2. In the Datasets table, click the name of the dataset you want to view its dataset summary.

Dataset summary table

note

H2O Model Validation marks feature columns with N/A (not applicable) if the column feature value is non-numeric.

Column nameDescription
FeatureFeature name (one of the column names in the dataset).
Data typeData type. For example, integer.
CountNumber (count) of value features present in the feature column.
MissingNumber of missing feature values.
Missing ratioPercentage of missing feature values (N) (N/ Number of rows).
MeanTypical feature value.
Standard deviationStandard deviation. A measure of divergence or distribution of the feature values.
MinMinimum feature value.
MaxMaximum feature value.
Number of uniqueUnique feature values.
FrequencyFeature frequency value.
Datetime formatDate string format (for example, yy-mm-dd, hh-mm-ss (hourly granularity) or dd-mm-yy).
Datetime unitThe highest unit of time. Based on incremental date changes (for example, seconds, minutes, hours, days, or weeks).

Histogram: Feature histogram

The feature histogram displays the data frequency distribution of the selected feature in the Dataset summary table.

  • X-axis: Feature values
  • Y-axis: Number of feature values

Feature histogram


Feedback