Integration with Amazon Glue - Amazon Athena
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Integration with Amazon Glue

Amazon Glue is a fully managed ETL (extract, transform, and load) Amazon Web Service. One of its key abilities is to analyze and categorize data. You can use Amazon Glue crawlers to automatically infer database and table schema from your data in Amazon S3 and store the associated metadata in the Amazon Glue Data Catalog.

Athena uses the Amazon Glue Data Catalog to store and retrieve table metadata for the Amazon S3 data in your Amazon Web Services account. The table metadata lets the Athena query engine know how to find, read, and process the data that you want to query.

To create database and table schema in the Amazon Glue Data Catalog, you can run an Amazon Glue crawler from within Athena on a data source, or you can run Data Definition Language (DDL) queries directly in the Athena Query Editor. Then, using the database and table schema that you created, you can use Data Manipulation (DML) queries in Athena to query the data.

You can register an Amazon Glue Data Catalog from an account other than your own. After you configure the required IAM permissions for Amazon Glue, you can use Athena to run cross-account queries. For more information, see Cross-account access to Amazon Glue data catalogs.

For more information about the Amazon Glue Data Catalog, see Data Catalog and crawlers in Amazon Glue in the Amazon Glue Developer Guide.

Separate charges apply to Amazon Glue. For more information, see Amazon Glue pricing.