Merge datasets
H2O Hydrogen Torch enables you to merge imported datasets into one.
Note
- H2O Hydrogen Torch does not merge dataset files in the sense that rows are combined, and duplicate rows are removed. Merge, in this case, refers to bringing the dataset files a dataset might have to a single dataset (another dataset), continuing other dataset files already.
- H2O Hydrogen Torch cannot merge a dataset if it's already associated with an experiment.
Example
Consider the following two datasets before merging (dataset one and dataset two):
dataset_one.zip dataset_two.zip
│ └───csv_one.csv │ └───csv_two.csv
│ │ │ │
│ └───image_folder_one │ └───image_folder_two
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ └───name_of_image.image_extension │ └───name_of_image.image_extension
│ ... │ ...
After merging dataset one to dataset two:
merged_dataset_one_and_two.zip
│ └───csv_two.csv
│ └───csv_one.csv
│ │
│ └───image_folder_two
│ │ └───name_of_image.image_extension
│ │ └───name_of_image.image_extension
│ │ └───name_of_image.image_extension
│ │ ...
│ └───image_folder_one
│ └───name_of_image.image_extension
│ └───name_of_image.image_extension
│ └───name_of_image.image_extension
│ ...
Instructions
To merge an imported dataset (A) to another (B), consider the following instructions:
- In the H2O Hydrogen Torch navigation menu, click View datasets.
- In the datasets table, locate the dataset you want to merge to another dataset (A).
- Click Kebab menu.
- Click Edit dataset.
- Click Merge with existing dataset.
- In the Dataset list, select the dataset that will take the selected dataset above (B).
- Click Merge.
- Configure the dataset settings. Note
To learn about the import dataset settings, see Import dataset settings.
- Click Continue.
- Again, click Continue. Note
Before you click Continue, please review the dataset preview visualizations.
Feedback
- Submit and view feedback for this page
- Send feedback about H2O Hydrogen Torch to cloud-feedback@h2o.ai