Snowflake¶

Driverless AI allows you to explore Snowflake data sources from within the Driverless AI application. This section provides instructions for configuring Driverless AI to work with Snowflake. This setup requires you to enable authentication. If you enable Snowflake connectors, those file systems will be available in the UI, but you will not be able to use those connectors without authentication.

Snowflake with Authentication Example¶

This example enables the Snowflake data connector with authentication by passing the account, user, and password variables. Replace TAG below with the image tag.

nvidia-docker run \
    --rm \
    --shm-size=256m \
    -e DRIVERLESS_AI_ENABLED_FILE_SYSTEMS="file,snow" \
    -e DRIVERLESS_AI_SNOWFLAKE_ACCOUNT = "<account_id>" \
    -e DRIVERLESS_AI_SNOWFLAKE_USER = "<username>" \
    -e DRIVERLESS_AI_SNOWFLAKE_PASSWORD = "<password>"\
    -u `id -u`:`id -g` \
    -p 12345:12345 \
    -v `pwd`/data:/data \
    -v `pwd`/log:/log \
    -v `pwd`/license:/license \
    -v `pwd`/tmp:/tmp \
    -v `pwd`/service_account_json.json:/service_account_json.json \
    h2oai/dai-centos7-x86_64:TAG

After the Snowflake connector is enabled, you can add datasets by selecting Snowflake from the Add Dataset (or Drag and Drop) drop-down menu.

Specify the following information to add your dataset.

Enter Output Filename: Specify the name of the file on your local system that you want to add to Driverless AI. Note that this can only be a CSV file (for example, myfile.csv).
Enter Database: Specify the name of the Snowflake database that you are querying.
Enter Warehouse: Specify the name of the Snowflake warehouse that you are querying.
Enter Schema: Specify the schema of the dataset that you are querying.
Enter Region: (Optional) Specify the region of the warehouse that you are querying. This can be found in the Snowflake-provided URL to access your database (as in <optional-deployment-name>.<region>.<cloud-provider>.snowflakecomputing.com).
Enter Role: (Optional) Specify your role as designated within Snowflake. See https://docs.snowflake.net/manuals/user-guide/security-access-control-overview.html for more information.
Enter File Formatting Params: (Optional) Specify any additional parameters for formatting your datasets. Available parameters are listed in https://docs.snowflake.net/manuals/sql-reference/sql/create-file-format.html#optional-parameters. (Note: Use only parameters for TYPE = CSV.) For example, if your dataset includes a text column that contains commas, you can specify a different delimiter using FIELD_DELIMITER='character'. Separate multiple parameters with spaces only. For example:

FIELD_DELIMITER='|' FIELD_OPTIONALLY_ENCLOSED_BY=""

Enter Snowflake Query: Specify the Snowflake query that you want to execute.
When you are finished, select the Click to Make Query button to add the dataset.