Enabling Data Connectors

Driverless AI provides various data connectors for external data sources. Data sources are exposed in the form of the file systems. Each file system is prefixed by a unique prefix. For example:

  • To reference data on S3, use s3://.
  • To reference data on HDFS, use the prefix hdfs://.
  • To reference data on Azure Blob Store, use https://<storage_name>.blob.core.windows.net.
  • To reference data on BlueData Datatap, use dtap://.
  • To reference data on Google BigQuery, make sure you know the Google BigQuery dataset and the table that you want to query. Use a standard SQL query to ingest data.
  • To reference data on Google Cloud Storage, use gs://
  • To reference data on kdb+, use the hostname and the port http://<kdb_server>:<port>
  • To reference data on Minio, use http://<endpoint_url>.
  • To reference data on Snowflake, use a standard SQL query to ingest data.

Refer to the following sections for more information: