Connecting to your data - Amazon Glue DataBrew
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Connecting to your data

For more information on connecting to the following data sources, choose the section that applies to you.

  • Amazon Glue Data Catalog – You can use the Data Catalog to define references to data objects stored in the Amazon Cloud, including the following services:

    • Amazon Redshift

    • Aurora MySQL

    • Aurora PostgreSQL

    • Amazon RDS for MySQL

    • Amazon RDS for PostgreSQL

    DataBrew recognizes all Lake Formation permissions that have been applied to Data Catalog resources, so DataBrew users can only access these resources if they're authorized.

    To create a dataset, you specify a Data Catalog database name and a table name. DataBrew takes care of the other connection details.

  • Amazon Data Exchange – You can choose from hundreds of third-party data sources that are available in Amazon Data Exchange. By subscribing to these data sources, you always have the most up-to-date version of the data.

    To create a dataset, you specify the name of a Data Exchange data product that you're subscribed to or entitled to use.

  • JDBC driver connections – You can create a dataset by connecting DataBrew to a JDBC-compatible data source. DataBrew supports connecting to the following sources through JDBC:

    • Amazon Redshift

    • Microsoft SQL Server

    • MySQL

    • Oracle

    • PostgreSQL

    • Snowflake