Skip to main content
Version: v1.2.0

Merge datasets

H2O Hydrogen Torch enables you to merge imported datasets into one.

Note
  • H2O Hydrogen Torch does not merge dataset files in the sense that rows are combined, and duplicate rows are removed. Merge, in this case, refers to bringing the dataset files a dataset might have to a single dataset (another dataset), continuing other dataset files already.
  • H2O Hydrogen Torch cannot merge a dataset if it's already associated with an experiment.
Example

Consider the following two datasets before merging (dataset one and dataset two):

dataset_one.zip                                 dataset_two.zip 
│ └───csv_one.csv │ └───csv_two.csv
│ │ │ │
│ └───image_folder_one │ └───image_folder_two
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ ... │ ...

After merging dataset one to dataset two:

merged_dataset_one_and_two.zip 
│ └───csv_two.csv
│ └───csv_one.csv
│ │
│ └───image_folder_two
│ │ └───name_of_image.image_extension
│ │ └───name_of_image.image_extension
│ │ └───name_of_image.image_extension
│ │ ...
│ └───image_folder_one
│ └───name_of_image.image_extension
│ └───name_of_image.image_extension
│ └───name_of_image.image_extension
│ ...

Instructions

To merge an imported dataset (A) to another (B), consider the following instructions:

  1. In the H2O Hydrogen Torch navigation menu, click View datasets.
  2. In the datasets table, locate the dataset you want to merge to another dataset (A).
  3. Click Kebab menu.
  4. Click Edit dataset.
  5. Click Merge with existing dataset.
  6. In the Dataset list, select the dataset that will take the selected dataset above (B).
  7. Click Merge.
  8. Configure the dataset settings.
    Note

    To learn about the import dataset settings, see Import dataset settings.

  9. Click Continue.
  10. Again, click Continue.
    Note

    Before you click Continue, please review the dataset preview visualizations.


Feedback