Skip to main content
Version: v0.2.0

Tutorial 1B: Annotation task: Image classification

This tutorial will underline the steps (process) of annotating and specifying an annotation task rubric for an image classification annotation task. This tutorial will also quickly explore how you can download the fully annotated dataset supported in H2O Hydrogen Torch.

Step 1: Explore dataset

We will use the preloaded car-or-coffee-demo demo dataset for this tutorial. The dataset contains 40 images, each depicting a car or coffee. Let's quickly explore the dataset.

  1. On the Label Genie navigation menu, click Datasets.
  2. In the datasets table, click car-or-coffee-demo.

car-or-coffee-sample

Step 2: Create annotation task

Now that we have seen the dataset let's create an annotation task that will enable you to annotate the dataset. An annotation task refers to the process of labeling data. For this tutorial, an image classification annotation task refers to assigning one or more categorical target labels to an input image. Let's create an annotation task.

  1. Click New annotation task.
  2. In the Task name box, enter Tutorial 1B.
  3. In the Task description box, enter Annotate dataset containing images of cars and coffee.
  4. In the Select task list, select Classification.
  5. Click Create task.

Step 3: Specify annotation task rubric

Before we can start annotating our dataset, we need to specify an annotation task rubric. An annotation task rubric refers to the labels (e.g., object classes) you want to use when annotating your dataset. For our dataset, there are two categorical target labels we want to specify, car and coffee. Let's define the annotation task rubric.

  1. In the Class name box, enter Car.
  2. Click Add class.
  3. In the new Class name box, enter Coffee.

Classes

Step 4: Annotate dataset

Now that we have specified the annotation task rubric, let's annotate the dataset.

  1. Click Continue to annotate.

In the Annotate tab, you can individually annotate each image in the dataset. Let's annotate the first image. In this case:

  1. Select Car or Coffee.
    Note
    • As you annotate the dataset, you can select which label to use depending on whether the image depicts a car or coffee. To select the Coffee or Car label: Click Coffee or Car.
    • A zero-shot learning model is on by default when you annotate an image classification annotation task. The model accelerates the annotation (labeling) process by providing the percentage probability of an image (in this case, a car or coffee image) belonging to a certain label (one of the labels created in the Rubric tab).
  2. Click Save and next.
    Note
    • Save and next saves the annotated image
    • To skip an image to annotate later: Click Skip.
      • Skipped images (samples) will reappear after all non-skipped images are annotated

Export annotated dataset

After annotating all the images, you can download the dataset in a format that H2O Hydrogen Torch supports. Let's download the annotated dataset.

  1. In the Annotate tab, click Export annotated samples.

    Export annotated samples

    Note
    • In the Dashboard tab, you can download all annotated images at any point in the annotation process of a dataset. To download all annotated images so far, consider the following instructions:
      1. Click the Dashboard tab.
      2. Click Export approved samples.
    • H2O Label Genie will download a .zip file containing the annotated dataset in a format aligning with the dataset's problem type (annotation task type). To learn more, see Downloaded dataset formats: Image classification.

Summary

In this tutorial, we learned the process of annotating and specifying an annotation task rubric for an image classification annotation task. We also learned how to download a fully annotated dataset supported in H2O Hydrogen Torch.

Next

To learn the process of annotating and specifying an annotation task rubric for other various annotation tasks in computer vision (CV), natural language processing (NLP), and audio, see Tutorials.


Feedback