Skip to main content
Version: v0.7.x

Import Google BigQuery data

This tutorial guides you through importing data from Google BigQuery into your H2O Drive workspace. Follow the steps below to connect to your datasource and import the data.

Prerequisites

Before you begin, you will need:

  • A Google BigQuery account
  • Your Google BiqQuery Project ID

Step 1: connect the Google project

Let's connect to your Google BigQuery project.

  1. On the H2O Drive home page, click Import.
  2. Select Google BigQuery from the dropdown list of sources.
  3. If you have not added a credentials profile already, click Add New Credentials.
  4. Enter the following details to connect to the Google Bigquery project.
    • Profile Name: A suitable name for your personal credentials profile. This is the name that will appear on the dropdown list (which you saw on the previous screen) when you are selecting the credentials profile you wish to use.
    • Project Name: The Google Bigquery project name.
    • Google Storage Bucket Location: The location path to the Google Storage Bucket in the format bucket_name/path_to_dataset_directory.
    • Google Cloud Service Account Key (JSON): By default, there is a service account key file created for every Google Cloud Project. Click Browse and upload the relevant project's service account key .json file here.
      note

      For more information on retrieving the exact project name, see List Projects in the Google BigQuery API documentation.

      import-bigquery-data
  5. Click Save.

Step 2: select the data tables

  1. Select the credentials profile that you just created from the dropdown list.
  2. Enter the following details to indicate the specific table within the data that you want to import.
    • Table ID: The table ID of the specific table you want to import.
    • Save Dataset As: A name for the dataset table that you are importing. This name will be used as the display name for the dataset on the H2O Drive UI. select-data-table
  3. Click Next. You have successfully imported a dataset! You should now be able to see it displayed on H2O Drive under the specified filename. imported-dataset

Step 3: share the dataset

  1. Select the imported dataset by clicking on the filename.
  2. Click Get Link to get a pre-signed link that you can share with other users or applications that need to access this dataset.
  3. Set the expiration time and click Get Link. set-expiry-time
  4. Copy the link that appears and click Close. You can now use the copied link to share this dataset.

You can import this dataset using the pre-signed link to H2O-3, Driverless AI, or share it with someone else who can then import it onto their H2O Drive instance using the HTTP download option.


Feedback