Skip to main content
Version: v1.3.0

Supported problem types

Overview

H2O Hydrogen Torch supports an array of diverse problem types in computer vision, natural language, and audio, and each, in turn, is explained below.

Image

Image regression

  • Description: Image regression refers to assigning one or more continuous target labels to an input image.
  • Supported classification tasks: Multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: Image regresssion.

3D image regression

  • Description: 3D image regression refers to assigning one or more continuous target labels to 3D images (inputs).
  • Supported classification tasks: Multi-label.
note

To learn about the available settings for this problem type, see see Experiment settings: 3D image regresssion.

Image classification

  • Description: Image classification refers to assigning one or more categorical target labels to an input image.
  • Supported classification tasks: Binary, multi-class, and multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: Image classification.

3D image classification

  • Description: 3D image classification refers to assigning one or more categorical target labels to 3D images (inputs).
  • Supported classification tasks: Binary, multi-class, and multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: 3D image classification.

Image object detection

  • Description: Image object detection refers to locating individual objects in an image by drawing bounding boxes around them.
  • Supported classification tasks: Multi-class.
note

To learn about the available settings for this problem type, see Experiment settings: Image object detection.

Image semantic segmentation

  • Description: Image semantic segmentation refers to associating each pixel of an image with a class label (such as phones, pencils, or roads).
  • Supported classification tasks: Multi-class.
note

To learn about the available settings for this problem type, see Experiment settings: Image semantic segmentation.

3D image semantic segmentation

  • Description: 3D image semantic segmentation refers to associating each pixel of a 3D image with a particular class label.
  • Supported classification tasks: Multi-class.
note

To learn about the available settings for this problem type, see Experiment settings: 3D image semantic segmentation.

Image instance segmentation

  • Description: Image instance segmentation refers to locating individual objects in an image by drawing masks around them.
  • Supported classification tasks: Multi-class.
note

To learn about the available settings for this problem type, see Experiment settings: Image instance segmentation.

Image metric learning

  • Description: Image metric learning refers to establishing similarity or dissimilarity between images.
note

To learn about the available settings for this problem type, see Experiment settings: Image metric learning.

Text

Text regression

  • Description: Text regression refers to assigning one or more continuous target labels to an input text.
  • Supported classification tasks: Multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: Text regression.

Text classification

  • Description: Text classification refers to assigning one or more categorical target labels to an input text; this includes binary classification, multi-class classification, and multi-label classification.
  • Supported classification tasks: Binary, multi-class, and multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: Text classification.

Text token classification

  • Description: Text token classification refers to assigning a label to all tokens in a piece of text in contrast to text classification, where the entire text is given a label.
  • supported classification tasks: Binary and multi-class.
note

To learn about the available settings for this problem type, see Experiment settings: Text token classification.

Text span prediction

  • Description: Text span prediction refers to the task of finding a substring in a larger context. A typical problem (task) is question-answering, where given a context and question, the task is to find the answer (substring) in the context.
note

To learn about the available settings for this problem type, see Experiment settings: Text span prediction.

Text sequence to sequence

  • Description: Text sequence to sequence (Seq2seq) refers to the task of predicting an output sequence given an input sequence; in other words, Seq2seq turns one sequence into another (sequence transformation).
note

To learn about the available settings for this problem type, see Experiment settings: Text sequence to sequence.

Text metric learning

  • Description: Text metric learning refers to establishing similarity or dissimilarity between texts.
note

To learn about the available settings for this problem type, see Experiment settings: Text metric learning.

Audio

Audio regression

  • Description: Audio regression refers to assigning one or more continuous target labels to input audio.
  • Supported classification tasks: Multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: Audio regression.

Audio classification

  • Description: Audio classification refers to assigning one or more categorical target labels to input audio.
  • Supported classification tasks: Binary, multi-class, and multi-label.
note

To learn about the available settings for this problem type, see Experiment settings: Audio classification.

Speech

Speech recognition

  • Description: Speech recognition refers to the process of converting input audio to text.
note

To learn about the available settings for this problem type, see Experiment settings: Speech recognition.


Feedback