Skip to main content
Version: v0.4.0

Tutorial 1C: Audio classification annotation task

Overview

This tutorial describes the process of creating an audio classification annotation task, including specifying an annotation task rubric for it. To highlight the process, we will annotate a dataset containing 5-second-long recordings of environmental sounds organized into ten classes (with 40 examples per class).

Step 1: Explore dataset

This tutorial uses the preloaded ESC10 audio demo dataset, which contains 400 audios, each depicting the sound of a chainsaw, dog, helicopter, rain, rooster, etc. Let's quickly explore the dataset.

  1. On the H2O Label Genie navigation menu, click Datasets.
  2. In the Datasets table, click esc10-audio-demo.

Dataset details esc10-audio-demo recordings of 5 seconds each

Step 2: Create an annotation task

Now that we have seen the dataset let's create an annotation task that enables you to annotate the dataset. For this tutorial, an audio classification annotation task refers to assigning a categorical target label to an audio clip.

  1. Click New annotation task.
  2. In the Task name box, enter tutorial-1c.
  3. In the Task description box, enter Annotate a dataset containing samples of environmental sounds.
  4. In the Select task list, select Classification.
  5. Click Create task.

Step 3: Specify an annotation task rubric

Before annotating our dataset, we need to specify an annotation task rubric. An annotation task rubric refers to the labels (for example, object classes) you want to use when annotating your dataset. For our dataset, the following are the multiple categorical target labels we want to specify:

  • chainsaw
  • clock_tick
  • crackling_fire
  • crying_baby
  • dog
  • helicopter
  • rain
  • rooster
  • sea_waves
  • sneezing

Let's define the annotation task rubric.

  1. In the New class name box, enter chainsaw.
  2. Click Add.
  3. Click Add class.
  4. In the New class name box, enter clock_tick.
  5. Click Add.
  6. Repeat the above steps until you create all labels.
  7. Click Continue to annotate.

audio-classification-add-classes.png

note

H2O Label Genie supports multi-label audio classification annotation tasks.

Step 4: Annotate dataset

In the Annotate tab, you can individually annotate each audio clip in the dataset. Let's annotate the first audio.

  1. Select the label that you associate with the sound audio you hear (for example, sneezing). Selecting a label
  2. Click Save and next.
    Note
    • Save and next saves the annotated audio
    • To skip an audio clip to annotate later: Click Skip.
      • Skipped audio clips (samples) reappear after all non-skipped audios are annotated
  3. Annotate all dataset samples.
    note

    At any point in an annotation task, you can download the already annotated (approved) samples. You do not need to fully annotate an imported dataset to download already annotated samples. To learn more, see Download an annotated dataset

Summary

In this tutorial, we learned the process of annotating and specifying an annotation task rubric for an audio classification annotation task.

Next

To learn the process of annotating and specifying an annotation task rubric for other various annotation tasks in computer vision (CV), natural language processing (NLP), and audio, see Tutorials.


Feedback