Task 2: Explore dataset
Let's explore the dataset to understand each column.
- In the DATASETS page, observe the two datasets we will use for this tutorial.
- Click the
AmazonFineFoodReviews_train...
dataset and select the DETAILS option. - Let’s take a quick overview of the columns of the training dataset:
The dataset consists of 11 columns which are as follows:
a. UserId - unique identifier for the user
b. ProductId - unique identifier for the product
c. Id - review ID
d. Summary - brief summary of the review
e. Score - rating between 1 and 5
f. HelpfulnessDenominator - number of users who indicated whether they found the review helpful or not
g. ProfileName - profile name of the userContinue scrolling to the right of the page to view the rest of the columns of the dataset:
h. HelpfulnessNumerator - number of users who found the review helpful
i. Time - timestamp for the review
j. Description - text of the review
k. Positive Review - whether the review is Positive or Negative
- Return to the Datasets page.
Feedback
- Submit and view feedback for this page
- Send feedback about H2O Driverless AI | Tutorials to cloud-feedback@h2o.ai