Skip to main content
Version: v0.7.x

Import Azure Blob Storage data

This tutorial guides you through importing data blobs from Azure Blob Storage into your H2O Drive workspace. Follow the steps below to connect to your datasource and import the data.

Prerequisites

Before you begin, you will need:

  • Access to an AWS account
  • A signed URL to the Azure Blob Storage Resource

Step 1: connect the Azure Blob resource

Let's connect to your Azure Blob resource.

  1. On the H2O Drive home page, click Import.
  2. Select Azure Blob Storage from the dropdown list of sources.
  3. If you have not added a credentials profile already, click Add New Credentials.
  4. Enter the following details to connect to the Azure Blob resouce.
    • Profile Name: A suitable name for your personal credentials profile. This is the name that will appear on the dropdown list (which you saw on the previous screen) when you are selecting the credentials profile you wish to use.
    • Storage Account Name: The name of the Azure Blob Storage account.
      • Is Public: Select this checkbox if the Azure Blob Storage account is public.
    • Access Key: The access key associated with your Azure account.
    • Connection String: The Azure Storage connection string that includes authorization information required for H2O Drive to access data in an Azure Storage account.
      note

      For more information, see Authenticate with the account access key and Configure connection string in the Azure Blob Storage Documentation.

      import-azure-data
  5. Click Save.

Step 2: select the Azure blob

  1. Select the credentials profile that you just created from the dropdown list.
  2. Enter the following details to indicate the specific blob within the dataset that you want to import.
  3. Click Next. You have successfully imported a dataset! You should now be able to see it displayed on H2O Drive under the specified filename. imported-dataset

Step 3: share the dataset

  1. Select the imported dataset by clicking on the filename.
  2. Click Get Link to get a pre-signed link that you can share with other users or applications that need to access this dataset.
  3. Set the expiration time and click Get Link. set-expiry-time
  4. Copy the link that appears and click Close. You can now use the copied link to share this dataset.

You can import this dataset using the pre-signed link to H2O-3, Driverless AI, or share it with someone else who can then import it onto their H2O Drive instance using the HTTP download option.


Feedback