Zero-shot learning models
Overview
By default, H2O Label Genie utilizes certain zero-shot learning models to accelerate the labeling process. In particular, H2O Label Genie lets you use a zero-shot learning model for the following supported annotation tasks:
- Image classification
- Object detection
- Image instance segmentation
- Text classification
- Text summarization
A zero-shot learning model can be turned On or Off at any time. To learn more, see Smart annotation.
What are zero-shot learning models?
Labeled data is crucial for supervised learning problem types in computer vision (CV), natural language processing (NLP), and audio. High-quality labeled data usually requires a lot of manual labeling that can lead to high costs and delay production or execution.
One way to accelerate the labeling process is to utilize zero-shot learning models. These models let data scientists label unlabeled data with high accuracy and speed. Zero-shot learning models are pre-trained models that have been trained on vast and distinct classes. As a result, zero-shot learning models with prior knowledge can label unlabeled data.
What are zero-shot learning model predictions?
The labels or suggested labels for a given sample that are provided by a zero-shot learning model are called zero-shot learning model predictions. For example, for an image and text classification annotation task, H2O Label Genie, with a zero-shot learning model activated, offers a percentage probability of an image or text belonging to a certain label (class). For an object detection annotation task, it populates an image with bounding boxes where the bounding boxes capture the desired objects (for example, a car).
Annotation tasks + zero-shot learning models
Image classification
For image classification annotation tasks, H2O Label Genie utilizes the OpenCLIP zero-shot learning model. OpenCLIP is an adaptation of OpenAI's Contrastive Language-Image Pre-training (CLIP).
- To learn more, see Open-clip.
Object detection
For object detection annotation tasks, H2O Label Genie utilizes the Detic zero-shot learning model.
- To learn more, see Detecting Twenty-thousand Classes using Image-level Supervision.
Image instance segmentation
For image instance segmentation annotation tasks, H2O Label Genie utilizes the Detic zero-shot learning model.
- To learn more, see Detecting Twenty-thousand Classes using Image-level Supervision.
Text classification
For text classification annotation tasks, H2O Label Genie utilizes the bart-large-mnli zero-shot learning model.
- To learn more, see Bart-large-mnli.
Text summarization
For text summarization annotation tasks, H2O Label Genie utilizes the following zero-shot learning models: bart-large-cnn, distilbart-cnn-12-6, and pegasus-large.
- To learn more, see Bart-large-cnn.
- To learn more, see Distilbart-cnn-12-6.
- To learn more, see Pegasus-large.
- Submit and view feedback for this page
- Send feedback about H2O Label Genie to cloud-feedback@h2o.ai