Install on Azure

This section describes how to install the Driverless AI image from Azure.

Watch the installation video here. Note that some of the images in this video may change between releases, but the installation steps remain the same.

Environment

Provider Instance Type Num GPUs Suitable for
Azure Standard_NV6 1 Experimentation
Standard_NV12 2 Experimentation
Standard_NV24 4 Serious use
Standard_NC6 1 Experimentation
Standard_NC12 2 Experimentation
Standard_NC24 4 Serious use

Installing the Azure Instance

  1. Log in to your Azure portal at https://portal.azure.com, and click the Create a Resource button.
  2. Search for and select H2O DriverlessAI in the Marketplace.
../_images/azure_select_driverless_ai.png
  1. Click Create. This launches the H2O DriverlessAI Virtual Machine creation process.
../_images/azure_search_for_dai.png
  1. On the Basics tab:
  1. Enter a name for the VM.
  2. Select the Disk Type for the VM. Use HDD for GPU instances.
  3. Enter the name that you will use when connecting to the machine through SSH.
  4. Enter and confirm a password that will be used when connecting to the machine through SSH.
  5. Specify the Subscription option. (This should be Pay-As-You-Go.)
  6. Enter a name unique name for the resource group.
  7. Specify the VM region.

Click OK when you are done.

../_images/azure_basics_tab.png
  1. On the Size tab, select your virtual machine size. Specify the HDD disk type and select a configuration. We recommend using an N-Series type, which comes with a GPU. Also note that Driverless AI requires 10 GB of free space in order to run and will stop working of less than 10 GB is available. We recommend a minimum of 30 GB of disk space. Click OK when you are done.
../_images/azure_vm_size.png
  1. On the Settings tab, select or create the Virtual Network and Subnet where the VM is going to be located and then click OK.
../_images/azure_settings_tab.png
  1. The Summary tab performs a validation on the specified settings and will report back any errors. When the validation passes successfully, click Create to create the VM.
../_images/azure_summary_tab.png
  1. After the VM is created, it will be available under the list of Virtual Machines. Select this Driverless AI VM to view the IP address of your newly created machine. Then open a terminal window and ssh into the machine running the VM. Optionally run pwd to retrieve your current location in the VM, and optionally run nvidia-smi to verify that the NVIDIA driver is running.
  2. Use the following command to retrieve the latest Driverless AI version.
wget dai-docker-centos7-x86_64-1.5.0-9.0.tar.gz
  1. Set up the data, log, and license directories on the host machine:
# Set up the data, log, license, and tmp directories on the host machine
cd /etc/h2oai
mkdir data
mkdir log
mkdir license
mkdir tmp
  1. At this point, you can copy data into the data directory on the host machine using scp. For example:
scp <data_file>.csv <username>@<vm_address>:/etc/h2oai/data

The data will be visible inside the Docker container.

  1. Load the Driverless AI Docker image
docker load < dai-docker-centos7-x86_64-1.5.0-9.0.tar.gz
  1. If you selected a GPU machine, then you must enable persistence of the GPU. Note that this needs to be run once every reboot. Refer to the following for more information: http://docs.nvidia.com/deploy/driver-persistence/index.html.
sudo nvidia-persistenced --user <USER>
sudo nvidia-smi -pm 1
  1. Start the Driverless AI Docker image:
# Start the Driverless AI Docker image
nvidia-docker run \
    --pid=host \
    --init \
    --rm \
    --shm-size=256m \
    -u `id -u`:`id -g` \
    -p 12345:12345 \
    -v `pwd`/data:/data \
    -v `pwd`/log:/log \
    -v `pwd`/license:/license \
    -v `pwd`/tmp:/tmp \
    h2oai/dai-centos7-x86_64:1.5.0-9.0

Driverless AI will begin running:

--------------------------------
Welcome to H2O.ai's Driverless AI
---------------------------------
     version: 1.5.0

- Put data in the volume mounted at /data
- Logs are written to the volume mounted at /log/20180606-044258
- Connect to Driverless AI on port 12345 inside the container
- Connect to Jupyter notebook on port 8888 inside the container
  1. Connect to Driverless AI with your browser:
http://Your-Driverless-AI-Host-Machine:12345

Stopping the Azure Instance

The Azure instance will continue to run even when you close the Azure portal. To stop the instance:

  1. Click the Virtual Machines left menu item.
  2. Select the checkbox beside your DriverlessAI virtual machine.
  3. On the right side of the row, click the … button, then select Stop. (Note that you can then restart this by selecting Start.)
../_images/azure_stop_vm.png

Upgrading the Driverless AI Image

WARNING: Experiments, MLIs, and MOJOs reside in the Driverless AI tmp directory and are not automatically upgraded when Driverless AI is upgraded.

  • Build MLI models before upgrading.
  • Build MOJO pipelines before upgrading.
  • Back up your Driverless AI tmp directory before upgrading.

If you did not build MLI on a model before upgrading Driverless AI, then you will not be able to view MLI on that model after upgrading. Before upgrading, be sure to run MLI jobs on models that you want to continue to interpret in future releases. If that MLI job appears in the list of Interpreted Models in your current version, then it will be retained after upgrading.

If you did not build a MOJO pipeline on a model before upgrading Driverless AI, then you will not be able to build a MOJO pipeline on that model after upgrading. Before upgrading, be sure to build MOJO pipelines on all desired models and then back up your Driverless AI tmp directory.

Upgrading from Version 1.2.2 or Earlier

It is not possible to upgrade from version 1.2.2 or earlier to the latest version. You have to manually remove the 1.2.2 container and then reinstall the latest Driverless AI version. Be sure to back up your data before doing this.

Upgrading from Version 1.3.0 or Later

Note: Stop Driverless AI if it is still running.

  1. SSH into the machine that is running Driverless AI.
  2. Set up a directory for the version of Driverless AI on the host machine:
# Set up directory with the version name
mkdir dai_rel_1.5.0

# cd into the new directory
cd dai_rel_1.5.0
  1. Retrieve the Driverless AI package from https://www.h2o.ai/download/ and add it to the new directory.
  2. Load the Driverless AI Docker image inside the new directory. For example:
# Load the Driverless AI docker image
docker load < dai-docker-centos7-x86_64-1.5.0-9.0.tar.gz
  1. Copy the data, log, license, and tmp directories from the previous Driverless AI directory to the new Driverless AI directory:
# Copy the data, log, license, and tmp directories on the host machine
cp -r dai_rel_1.3.0/data dai_rel_1.5.0/data
cp -r dai_rel_1.3.0/log dai_rel_1.5.0/log
cp -r dai_rel_1.3.0/license dai_rel_1.5.0/license
cp -r dai_rel_1.3.0/tmp dai_rel_1.5.0/tmp

At this point, your experiments from the previous versions will be visible inside the Docker container.

  1. Start the Driverless AI Docker image.
  2. Connect to Driverless AI with your browser at http://Your-Driverless-AI-Host-Machine:12345.