Version: v0.4.0

Tutorial 1C: Audio classification annotation task

Overview

This tutorial describes the process of creating an audio classification annotation task, including specifying an annotation task rubric for it. To highlight the process, we will annotate a dataset containing 5-second-long recordings of environmental sounds organized into ten classes (with 40 examples per class).

Step 1: Explore dataset

This tutorial uses the preloaded ESC10 audio demo dataset, which contains 400 audios, each depicting the sound of a chainsaw, dog, helicopter, rain, rooster, etc. Let's quickly explore the dataset.

On the H2O Label Genie navigation menu, click Datasets.
In the Datasets table, click esc10-audio-demo.

Dataset details esc10-audio-demo recordings of 5 seconds each

Step 2: Create an annotation task

Now that we have seen the dataset let's create an annotation task that enables you to annotate the dataset. For this tutorial, an audio classification annotation task refers to assigning a categorical target label to an audio clip.

Click New annotation task.
In the Task name box, enter tutorial-1c.
In the Task description box, enter Annotate a dataset containing samples of environmental sounds.
In the Select task list, select Classification.
Click Create task.

Step 3: Specify an annotation task rubric

Before annotating our dataset, we need to specify an annotation task rubric. An annotation task rubric refers to the labels (for example, object classes) you want to use when annotating your dataset. For our dataset, the following are the multiple categorical target labels we want to specify:

chainsaw
clock_tick
crackling_fire
crying_baby
dog
helicopter
rain
rooster
sea_waves
sneezing

Let's define the annotation task rubric.

In the New class name box, enter chainsaw.
Click Add.
Click Add class.
In the New class name box, enter clock_tick.
Click Add.
Repeat the above steps until you create all labels.
Click Continue to annotate.

note

H2O Label Genie supports multi-label audio classification annotation tasks.

Step 4: Annotate dataset

In the Annotate tab, you can individually annotate each audio clip in the dataset. Let's annotate the first audio.

Select the label that you associate with the sound audio you hear (for example, sneezing).
Click Save and next.
Note
- Save and next saves the annotated audio
- To skip an audio clip to annotate later: Click Skip.
  - Skipped audio clips (samples) reappear after all non-skipped audios are annotated
Annotate all dataset samples.
note
At any point in an annotation task, you can download the already annotated (approved) samples. You do not need to fully annotate an imported dataset to download already annotated samples. To learn more, see Download an annotated dataset

Summary

In this tutorial, we learned the process of annotating and specifying an annotation task rubric for an audio classification annotation task.

To learn the process of annotating and specifying an annotation task rubric for other various annotation tasks in computer vision (CV), natural language processing (NLP), and audio, see Tutorials.

Feedback

Submit and view feedback for this page
Send feedback about H2O Label Genie to cloud-feedback@h2o.ai

Overview​

Step 1: Explore dataset​

Step 2: Create an annotation task​

Step 3: Specify an annotation task rubric​

Step 4: Annotate dataset​

Summary​

Next​