Import dataset settings: Image metric learning
Dataset name
Name of the dataset.
Problem type
Defines the problem type of the experiment, which also defines the settings H2O Hydrogen Torch displays for the experiment.
- The selected problem type and experience level determine the settings H2O Hydrogen Torch displays for the experiment
- The From experiment option allows you to use the settings from a previously run experiment
Train dataframe
Defines a .csv
or .pq
file containing a dataframe with training records that H2O Hydrogen Torch will use to train the model.
- The records will be combined into mini-batches when training the model.
- If a validation dataframe is provided, a fold column is not needed in the train dataframe.
Data folder
Defines the folder location of the assets (e.g., images or audio clips) the model utilizes for training. H2O Hydrogen Torch loads assets from this folder during training.
Validation dataframe
Defines a .csv
or .pq
file containing a dataframe with validation records that H2O Hydrogen Torch will use to evaluate the model during training.
- To set a Validation dataframe requires the Validation strategy to be set to Custom holdout validation. In this case, H2O Hydrogen Torch will fully respect the choice of a separate validation dataframe and will not perform any internal cross-validation. In other words, the model is trained on the full provided train dataframe, and model performance is evaluated on the provided validation dataframe.
- The validation dataframe should have the same format as the train dataframe but does not require a fold column.
Test dataframe
Defines a .csv
or .pq
file containing a dataframe with test records that H2O Hydrogen Torch will use to test the model.
The test dataframe should have the same format as the train dataframe but does not require a label column.
Data folder test
Defines the folder location of the assets (e.g., images or texts) H2O Hydrogen Torch will use to test the model. H2O Hydrogen Torch will load the assets from this folder when testing the model. This setting is only available if a test dataframe is selected.
The Data Folder Test setting appears when you specify a test dataframe in the Test Dataframe setting.
Label columns
Defines the name(s) of the dataframe column(s) that refer to the target value(s) H2O Hydrogen Torch will aim to predict.
- It can be more than one label column, and therefore, the target value to predict can be single or multi-column.
- Image classification supports multi-class and multilabel classification.
Image column
Defines the dataframe column storing the names of images that H2O Hydrogen Torch will load from the data folder and data folder test when training and testing the model.
- Submit and view feedback for this page
- Send feedback about H2O Hydrogen Torch to cloud-feedback@h2o.ai