Improving Amazon Glue performance - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Improving Amazon Glue performance

Baseline strategy for performance tuning

In order to improve Amazon Glue performance, you may consider updating certain performance related Amazon Glue parameters. When preparing to tune parameters, use the following best practices:

  • Determine your performance goals before beginning to identify problems.

  • Use metrics to identify problems before attempting to change tuning parameters.

For the most consistent results when tuning a job, develop a baseline strategy for your tuning work.

Generally, performance tuning is performed in the following workflow:

  1. Determine performance goals.

  2. Measure metrics.

  3. Identify bottlenecks.

  4. Reduce the impact of the bottlenecks.

  5. Repeat steps 2-4 until you achieve the intended target.

Tuning strategies for your job type

Spark jobs–follow the guidance in Best practices for performance tuning Amazon Glue for Apache Spark jobs on Amazon Prescriptive Guidance.

Other jobs–you can tune Amazon Glue for Ray and Amazon Glue Python shell jobs by adapting strategies available in other runtime environments.