Skip to main content
Version: v1.4.0

Dataset format: Text regression

The data for a text regression experiment can be formatted following format 1 or 2.

A CSV file.

csv_name.csv (1)(2)
  1. The available dataset connectors require the data for a text regression experiment to be in a zip or CSV file.
    Note

    To learn how to upload your zip or CSV file as your dataset in H2O Hydrogen Torch, see Dataset connectors.

  2. A CSV file containing the following columns:
    • A text column containing the texts for the experiment
    • One or more label columns containing the numerical labels (targets)
      Note

      H2O Hydrogen Torch can train models that predict multiple labels simultaneously. You can provide multiple columns with multiple unique labels and choose which labels to predict when starting a new experiment.

    • An optional fold column containing cross-validation fold indexes
      Note

      The fold column can include integers (0, 1, 2, … , N-1 values or 1, 2, 3… , N values) or categorical values.


Feedback