Connecting to data
An Amazon Glue connection is a Data Catalog object that stores login credentials, URI strings, virtual private cloud (VPC) information, and more for a particular data store. Amazon Glue crawlers, jobs, and development endpoints use connections in order to access certain types of data stores. You can use connections for both sources and targets, and reuse the same connection across multiple crawler or extract, transform, and load (ETL) jobs.
Amazon Glue supports the following connection types:
-
Amazon DocumentDB
-
Amazon OpenSearch Service, for use with Amazon Glue for Spark.
-
Amazon Redshift
-
Azure Cosmos, for use of Azure Cosmos DB for NoSQL with Amazon Glue ETL jobs
-
Azure SQL, for use with Amazon Glue for Spark.
-
Google BigQuery, for use with Amazon Glue for Spark.
-
JDBC
-
Kafka
-
MongoDB
-
MongoDB Atlas
-
Salesforce
-
SAP HANA, for use with Amazon Glue for Spark.
-
Snowflake, for use with Amazon Glue for Spark.
-
Teradata Vantage, when using Amazon Glue for Spark.
-
Vertica, for use with Amazon Glue for Spark.
-
Various Amazon Relational Database Service (Amazon RDS) offerings.
-
Network (designates a connection to a data source that is in an Amazon Virtual Private Cloud (Amazon VPC))
-
Aurora (supported if the native JDBC driver is being used. Not all driver features can be leveraged)
With Amazon Glue Studio, you can also create a connection for a connector. A connector is an optional code package that assists with accessing data stores in Amazon Glue Studio. For more information, see Using connectors and connections with Amazon Glue Studio
For information about how to connect to on-premises databases, see How to access and analyze on-premises data stores using
Amazon Glue
This section includes the following topics to help you use Amazon Glue connections:
- Amazon Glue connection properties
- Storing connection credentials in Amazon Secrets Manager
- Adding an Amazon Glue connection
- Testing an Amazon Glue connection
- Configuring Amazon calls to go through your VPC
- Connecting to a JDBC data store in a VPC
- Using a MongoDB or MongoDB Atlas connection
- Crawling an Amazon S3 data store using a VPC endpoint
- Troubleshooting connection issues in Amazon Glue
- Tutorial: Using the Amazon Glue Connector for Elasticsearch