Skip to main content
Version: v0.7.x

Import Amazon Redshift data

This tutorial guides you through importing a database from Amazon Redshift into your H2O Drive workspace. Follow the steps below to connect to your datasource and import the data.

Prerequisites

Before you begin, you will need:

  • Access to an AWS account
  • Access credentials to your Amazon Redshift data warehouse

Step 1: connect the data warehouse

Let's connect to your Amazon Redshift data warehouse.

  1. On the H2O Drive home page, click Import.
  2. Select Amazon Redshift from the dropdown list of sources.
  3. If you have not added a credentials profile already, click Add New Credentials.
  4. Enter the following details to connect to the Redshift database.
    • Profile Name: A suitable name for your personal credentials profile. This is the name that will appear on the dropdown list (which you saw on the previous screen) when you are selecting the credentials profile you wish to use.
    • User: The username used to access the database.
    • Password: The password credential used to access the database.
    • Database: The name of the database.
    • Port: The port number on which the database instance is running.
    • Server: The connection URL to the database.
      note

      For more information, see Finding your cluster connection string in the Amazon Redshift documentation.

      import-redshift-data
  5. Click Save.
  6. Select the credentials profile that you just created from the dropdown list and click Next. select-credentials

Step 2: Select the data tables

  1. Select the data table that you wish to import. Downloading the dataset may take a few moments. select-data-table
  2. Click Next.
  3. Once the data table has been imported successfully, enter a filename for the table so that you will be able to easily identify it on your H2O Drive workspace later. enter-filename
  4. Click Save. You have successfully imported a dataset! You should now be able to see it displayed on H2O Drive under the specified filename. imported-dataset

Step 3: Share the dataset

  1. Select the imported dataset by clicking on the filename.
  2. Click Get Link to get a pre-signed link that you can share with other users or applications that need to access this dataset.
  3. Set the expiration time and click Get Link. set-expiry-time
  4. Copy the link that appears and click Close. You can now use the copied link to share this dataset.

You can import this dataset using the pre-signed link to H2O-3, Driverless AI, or share it with someone else who can then import it onto their H2O Drive instance using the HTTP download option.


Feedback