Skip to main content

Task 2: Explore dataset

Let's explore the dataset to understand each column.

  1. In the DATASETS page, observe the two datasets we will use for this tutorial.
  2. Click the AmazonFineFoodReviews_train... dataset and select the DETAILS option.
  3. Let’s take a quick overview of the columns of the training dataset:
  • The dataset consists of 11 columns which are as follows: dataset-columns-info-1 a. UserId - unique identifier for the user
    b. ProductId - unique identifier for the product
    c. Id - review ID
    d. Summary - brief summary of the review
    e. Score - rating between 1 and 5
    f. HelpfulnessDenominator - number of users who indicated whether they found the review helpful or not
    g. ProfileName - profile name of the user

  • Continue scrolling to the right of the page to view the rest of the columns of the dataset: dataset-columns-info-2 h. HelpfulnessNumerator - number of users who found the review helpful
    i. Time - timestamp for the review
    j. Description - text of the review
    k. Positive Review - whether the review is Positive or Negative

  1. Return to the Datasets page.

Feedback