Tutorial 1C: Audio classification annotation task
Overview
This tutorial describes the process of creating an audio classification annotation task, including specifying an annotation task rubric for it. To highlight the process, we will annotate a dataset containing 5-second-long recordings of environmental sounds organized into ten classes (with 40 examples per class).
Step 1: Explore dataset
This tutorial uses the preloaded ESC10 audio demo dataset, which contains 400 audios, each depicting the sound of a chainsaw, dog, helicopter, rain, rooster, etc. Let's quickly explore the dataset.
- On the H2O Label Genie navigation menu, click Datasets.
- In the Datasets table, click esc10-audio-demo.
Step 2: Create an annotation task
Now that we have seen the dataset let's create an annotation task that enables you to annotate the dataset. For this tutorial, an audio classification annotation task refers to assigning a categorical target label to an audio clip.
- Click New annotation task.
- In the Task name box, enter
tutorial-1c
. - In the Task description box, enter
Annotate a dataset containing samples of environmental sounds
. - In the Select task list, select Classification.
- Click Create task.
Step 3: Specify an annotation task rubric
Before annotating our dataset, we need to specify an annotation task rubric. An annotation task rubric refers to the labels (for example, object classes) you want to use when annotating your dataset. For our dataset, the following are the multiple categorical target labels we want to specify:
- chainsaw
- clock_tick
- crackling_fire
- crying_baby
- dog
- helicopter
- rain
- rooster
- sea_waves
- sneezing
Let's define the annotation task rubric.
- In the New class name box, enter
chainsaw
. - Click Add.
- Click Add class.
- In the New class name box, enter
clock_tick
. - Click Add.
- Repeat the above steps until you create all labels.
- Click Continue to annotate.
H2O Label Genie supports multi-label audio classification annotation tasks.
Step 4: Annotate dataset
In the Annotate tab, you can individually annotate each audio clip in the dataset. Let's annotate the first audio.
- Select the label that you associate with the sound audio you hear (for example, sneezing).
- Click Save and next. Note
- Save and next saves the annotated audio
- To skip an audio clip to annotate later: Click Skip.
- Skipped audio clips (samples) reappear after all non-skipped audios are annotated
- Annotate all dataset samples. note
At any point in an annotation task, you can download the already annotated (approved) samples. You do not need to fully annotate an imported dataset to download already annotated samples. To learn more, see Download an annotated dataset
Summary
In this tutorial, we learned the process of annotating and specifying an annotation task rubric for an audio classification annotation task.
Next
To learn the process of annotating and specifying an annotation task rubric for other various annotation tasks in computer vision (CV), natural language processing (NLP), and audio, see Tutorials.
- Submit and view feedback for this page
- Send feedback about H2O Label Genie to cloud-feedback@h2o.ai