Generating partition indexes - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Generating partition indexes

The Data Catalog supports creating partition indexes to provide efficient lookup for specific partitions. For more information, see Creating partition indexes. The Amazon Glue crawler creates partition indexes for Amazon S3 and Delta Lake targets by default.

Amazon Web Services Management Console
  1. Sign in to the Amazon Web Services Management Console and open the Amazon Glue console at https://console.amazonaws.cn/glue/.

  2. Choose Crawlers under the Data Catalog.

  3. When you define a crawler, the option to Create partition indexes automatically is enabled by default under Advanced options on the Set output and scheduling page.

    To disable this option, you can unselect the checkbox Create partition indexes automatically in the console.

  4. Complete the crawler configuration and choose Create crawler.

Amazon CLI

You can also disable this option by using the Amazon CLI, set the CreatePartitionIndex in the configuration parameter. The default value is true.

aws glue update-crawler \ --name myCrawler \ --configuration '{"Version": 1.0, "CreatePartitionIndex": false }'

Usage notes for partition indexes

  • Tables created by the crawler do not have the variable partition_filtering.enabled by default. For more information, see Amazon Glue partition indexing and filtering.

  • Creating partition indexes for encrypted partitions is not supported.