Version: v0.4.0

Specify an annotation task rubric

Overview

After creating a new annotation task, specify an annotation task rubric in the Rubric tab. An annotation task rubric refers to the labels (for example, object classes) to use when annotating a dataset. For example, after creating a new annotation task for an object detection dataset, you have to specify the object classes to use when labeling the dataset in the annotation task rubric.

Instructions

An annotation task rubric differs based on the specified task type of the dataset used to create the annotation task.

Text annotation tasks

Text classification
Text regression
Text-entity recognition
Text summarization
Text-generative AI

Instructions: Specify one or more categorical target labels for a text classification task rubric.
Example: To specify happy and unhappy as labels, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the New class name box, enter happy.
Click Add.
Click Add class.
In the New class name box, enter unhappy.
Click Add.

Text classification annotation task

note

To learn more, see Tutorial 1A: Text classification annotation task.

Instructions: You need to specify one continuous target label for a text regression task rubric.
Example: To create one continuous target label from 1 to 5, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the Data minimum value box, enter 1.
- The Data minimum value value refers to the minimum value in your continuous values (star ratings from 1 to 5)
In the Data maximum value box, enter 5.
- The Data maximum value value refers to the maximum value in your continuous values (star ratings from 1 to 5)
In the Data step size (interval) box, enter 1.
- The Data step size (interval) value refers to the value the label range slider interval takes (the slider is used in the next step to label a review)
Click Apply.

Text regression annotation task

note

To learn more, see Tutorial 2A: Text regression annotation task.

Instructions: You need to specify one or more defined entities.
Example: To create a product and emotion entity, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the New object name box, enter product.
Click Add.
Click Add entity.
In the New object name box, enter emotion.

Text-entity recognition rubric

note

To learn more, see Tutorial 3A: Text-entity recognition annotation task.

Instructions: You need to specify a zero-shot learning model and a maximum target length.
Example: To specify a zero-shot learning model and a maximum target length, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the Select model box, select sshleifer/distilbart-cnn-12-6.
- The Select model value refers to the zero-shot learning model to utilize in your annotation task. To learn more, see Annotation tasks + zero-shot learning models: Text summarization
In the Max target length box, enter 128.
- The Max target length value refers to the minimum character length of your summaries

Text summarization rubric

note

To learn more, see Tutorial 4A: Text summarization annotation task.

Image annotation tasks

Image classification
Image regression
Object detection
Image instance segmentation

For an image classification task rubric, you need specify one or more categorical target labels in the annotation task rubric for an image classification annotation task. To learn more, see Tutorial 1B: Annotation task: Image classification

Instructions: Specify one or more categorical target labels.
Example: To create a car and coffee label, one can consider the following instructions in the Rubric tab of the annotation task:

note
To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.
1. In the New class name box, enter car.
2. Click Add.
3. Click Add class.
4. In the New class name box, enter coffee.
5. Click Add.

Image classification rubric

note

To learn more, see Tutorial 1B: Image classification annotation task.

Instructions: Specify one or more object classes (labels).
Example: To specify car and coffee as a label, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the New object name box, enter car.
Click Add.
Click Add object class.
In the New object name box, enter coffee.
Click Add.

Image object detection annotation task rubric

note

To learn more, see Tutorial 3B: Object detection annotation task.

You need to specify one or more object classes (labels) in the annotation task rubric for an image instance segmentation annotation task. To learn more, see Tutorial 4B: Annotation task: Image instance segmentation.

Instructions: Specify one or more object classes (labels).
Example: To specify car and coffee as object classes, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the New object name box, enter car.
Click Add.
Click Add object class.
In the New object name box, enter coffee.
Click Add.

Image instance segmentation annotation task rubric

note

To learn more, see Tutorial 4B: Image instance segmentation annotation task.

Audio annotation tasks

Audio classification
Audio regression

Instructions: Specify one or more categorical target labels.
Example: To specify chainsaw and clock_tick as labels, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the New class name box, enter chainsaw.
Click Add.
Click Add class.
In the New class name box, enter clock_tick.
Click Add.

Audio classification annotation task rubric

note

To learn more, see Tutorial 1C: Audio classification annotation task.

Instructions: Specify one continuous target label.
Example: To create one continuous target label from 0 to 9, one can consider the following instructions in the Rubric tab of the annotation task:

note

To learn how to access the Rubric tab of an annotation task (or other tabs), see Access an annotation task's tabs.

In the Data minimum value box, enter 0.
- The Data minimum value refers to the minimum value in your continuous values (in this case, digits ranging from 0 to 9)
In the Data maximum value box, enter 9.
- The Data maximum value refers to the maximum value in your continuous values (in this case, digits ranging from 0 to 9)
In the Data step size (interval) box, enter 1.
- The Data step size (interval) value refers to the value the label range slider interval takes
Click Apply.

Audio regression annotation task rubric

note

To learn more, see Tutorial 2C: Audio regression annotation task.

Large language model (LLM) parameters

Select model family

Defines the zero-shot learning model family (large language model (LLM)) to utilize for the text-generative AI annotation task.

Details

Options

h2oGPT
- This option enables available h2oGPT large language models (LLMs) to utilize for a text-generative AI annotation task
h2oGPT Custom
- This option enables available h2oGPT large language models (LLMs) for a text-generative AI annotation task
OpenAI
- This option enables available OpenAI large language models (LLMs) in your OpenAI account for a text-generative AI annotation task. To connect to your OpenAI LLMs, see OpenAI API settings

LLM model name

Defines the zero-shot learning model name (large language model (LLM)) to utilize for a text-generative AI annotation task.

Max response tokens

Defines the maximum number of tokens for a response; a low number can result in short responses, which might limit the responses.

Temperature

Defines the randomness of predictions by scaling the logits. Higher temperature values increase creativity on the part of the model while producing more diverse outputs. In other words, the temperature makes the distribution more random.

Repetition penalty

Defines the penalty value of tokens frequently reappearing in the text (response). For example, a token that has already appeared ten times can be penalized more than a token that has appeared only two times. A 1.0 value means no penalty.

tip

This setting can be helpful when attempting to reduce the model's tendency to generate verbatim/identical text.

Prompt template

Select example prompt

This setting defines the input format for the selected model. There are several options, including the option to create your own custom input format (custom).

Feedback

Submit and view feedback for this page
Send feedback about H2O Label Genie to cloud-feedback@h2o.ai

Overview​

Instructions​

Text annotation tasks​

Image annotation tasks​

Audio annotation tasks​

Large language model (LLM) parameters​

Select model family​

LLM model name​

Max response tokens​

Temperature​

Repetition penalty​

Prompt template​

Select example prompt​

Overview

Instructions

Text annotation tasks

Image annotation tasks

Audio annotation tasks

Large language model (LLM) parameters

Select model family

LLM model name

Max response tokens

Temperature

Repetition penalty

Prompt template

Select example prompt