Skip to main content
Version: v1.4.0

Merge datasets

Overview

H2O Hydrogen Torch enables you to merge imported datasets into one.

Note

H2O Hydrogen Torch does not merge dataset files in the sense that rows are combined, and duplicate rows are removed. Merge, in this case, refers to bringing the dataset files a dataset might have to a single dataset (another dataset), continuing other dataset files already.

caution

H2O Hydrogen Torch cannot merge a dataset if it's already associated with an experiment.

Example

Consider the following two datasets before merging (dataset one and dataset two):

dataset_one.zip                                 dataset_two.zip 
│ └───csv_one.csv │ └───csv_two.csv
│ │ │ │
│ └───image_folder_one │ └───image_folder_two
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ ... │ ...

After merging dataset one to dataset two, the following new dataset is created:

merged_dataset_one_and_two.zip 
│ └───csv_two.csv
│ └───csv_one.csv
│ │
│ └───image_folder_two
│ │ └───name_of_image.image_extension
│ │ └───name_of_image.image_extension
│ │ └───name_of_image.image_extension
│ │ ...
│ └───image_folder_one
│ └───name_of_image.image_extension
│ └───name_of_image.image_extension
│ └───name_of_image.image_extension
│ ...

Instructions

To merge an imported dataset (A) to another (B), consider the following instructions:

  1. In the H2O Hydrogen Torch navigation menu, click View datasets.
  2. In the datasets table, locate the row of the dataset you want merge to another dataset (A).
  3. Click Kebab menu.
  4. Click Edit dataset.
  5. Click Merge with existing dataset.
  6. In the Dataset list, select the dataset to merge with the previously selected dataset (B).
  7. Click Merge.
  8. Configure the dataset settings.
    Note

    To learn about the import dataset settings, see Import dataset settings.

  9. Click Continue.
  10. Again, click Continue.
    Note

    Before you click Continue, please review the dataset preview visualizations.


Feedback