Improving Amazon Glue performance
Baseline strategy for performance tuning
In order to improve Amazon Glue performance, you may consider updating certain performance related Amazon Glue parameters. When preparing to tune parameters, use the following best practices:
Determine your performance goals before beginning to identify problems.
Use metrics to identify problems before attempting to change tuning parameters.
For the most consistent results when tuning a job, develop a baseline strategy for your tuning work.
Generally, performance tuning is performed in the following workflow:
Determine performance goals.
Measure metrics.
Identify bottlenecks.
Reduce the impact of the bottlenecks.
Repeat steps 2-4 until you achieve the intended target.
Tuning strategies for your job type
Spark jobs–follow the guidance in Best practices for performance tuning Amazon Glue for Apache Spark jobs on Amazon Prescriptive Guidance.
Other jobs–you can tune Amazon Glue for Ray and Amazon Glue Python shell jobs by adapting strategies available in other runtime environments.