Skip to main content
Version: v0.3.0

Annotate a dataset

Overview

You need to create an annotation task to annotate (label) a dataset.

An annotation task refers to labeling data in a manner that makes them suitable to support an array of deep-learning problem types. This process can, for example, involve adding bounding boxes to images where you can attribute labels to each box.

note

Instructions

To create an annotation task, consider the following instructions:

  1. On the H2O Label Genie navigation menu, click Annotation tasks.
  2. Click New annotation task.
  3. In the Task name box, enter a name for the annotation task.
  4. In the Task description box, enter a description for the annotation task.
  5. In the Select dataset list, select a dataset (the dataset you want to label).
    • If the data type of the selected dataset is text, proceed with the following instructions:
      1. In the Select task list, select the type of annotation you want to create (for example, classification, entity recognition, etc.).
      2. In the Select text column list, select the text column in your dataset (data).
    • If the data type of the selected dataset is image or audio, proceed with the following instructions:
      1. In the Select task list, select the type of annotation you want to create (for example, classification, regression, etc.).
  6. Click Create task.
    Note

    After creating a new annotation task, specify an annotation task rubric in the Rubric tab. An annotation task rubric refers to the labels (for example, object classes) to use when annotating a dataset. For example, after creating a new annotation task for an object detection dataset, you have to specify the object classes to use when labeling the dataset in the annotation task rubric. To learn more, see Specify an annotation task rubric.


Feedback