Skip to main content
Version: v0.8.x

Using Data in H2O Driverless AI

This page guides you through using data in H2O Drive, in your H2O Driverless AI (DAI) instance.

Prerequisites

Before you begin, you will need to:

There are two methods to access the data in H2O Drive on your H2O Driverless AI instance. You can use either one, depending on what works best for you:

  • Export the dataset to H2O Driverless AI: This method involves exporting your data directly from H2O Drive to your H2O Driverless AI instance without needing to start the DAI instance. This is the easiest method to export your data to H2O Driverless AI; however, you can only export one dataset file at a time using this method. If you have multiple datasets and directories to export, you can use the second method mentioned below.

  • Import data via H2O Driverless AI: This method requires starting up your H2O Driverless instance and importing the data from H2O Drive via DAI. You can use this method to export multiple datasets at a time.

Export dataset to H2O DAI

Once you have imported a dataset to H2O Drive, you will see it listed on the Datasets page.

  1. Click Kebab menu and click Export to Driverless AI.

    export-to-dai

    info

    Note that you can only export files to H2O Driverless AI but not directories.

  2. Specify the H2O Driverless AI instance that you wish to export the dataset to.

    select-dai-instance

  3. Once the export is successful, you will see the following success message. Click Open DAI Engine to view the dataset in H2O Driverless AI.

    export-to-dai-sucess

Import data via H2O Driverless AI

Step 1: launch Driverless AI instance

  1. Navigate to H2O AI Cloud on your browser and log in.
  2. To start creating a new AI engine, click MY AI ENGINES. This opens up the H2O Enterprise Steam home page.
  3. Click Driverless AI on the left side panel, and then click Launch Instance.
  4. Select the default-driverless-kubernetes profile and enter the following details to create your new instance of Driverless AI:
    • Instance Name: Enter a suitable name for your new Driverless AI instance.
    • Version: Select the latest version that is available. You can leave the default values as they are for the rest of the fields for this tutorial. For more information about these configurations, click the ? icon at the end of each row to view details about that field or see Launching Instance in the Enterprise Steam documentation.
  5. Click Launch Instance. The instance takes a few minutes to start up. Once it has started up, the instance is listed on the page with the status indicating that the instance is Running. DAI Instance
  6. Select an instance to build a model in by clicking the name of the instance on the Driverless AI Instances page. The Driverless AI home page is displayed.

Step 2: add dataset

  1. Click DATASETS on the top navbar.
  2. Click + ADD DATASET and select H2O DRIVE. add-dataset-dai
  3. Select the datasets you want to use within H2O Driverless AI. You can select multiple datasets to import, and you can use the search bar to find any particular dataset that you are looking for. select-datasets-dai
  4. Once you have selected the dataset(s), click CLICK TO IMPORT SELECTION.

You have successfully added the datasets to Driverless AI. You should be able to see it listed on the Datasets page. You can now modify the dataset, split the dataset, or create a project or experiment in Driverless AI using the imported dataset(s).


Feedback