Monitoring Amazon Glue Spark jobs - Amazon Glue
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Monitoring Amazon Glue Spark jobs

Spark Metrics available in Amazon Glue Studio

The Metrics tab shows metrics collected when a job runs and profiling is turned on. The following graphs are shown in Spark jobs:

  • ETL Data Movement

  • Memory Profile: Driver and Executors

Choose View additional metrics to show the following graphs:

  • ETL Data Movement

  • Memory Profile: Driver and Executors

  • Data Shuffle Across Executors

  • CPU Load: Driver and Executors

  • Job Execution: Active Executors, Completed Stages & Maximum Needed Executors

Data for these graphs is pushed to CloudWatch metrics if the job is configured to collect metrics. For more information about how to turn on metrics and interpret the graphs, see Job monitoring and debugging.

Example ETL data movement graph

The ETL Data Movement graph shows the following metrics:


                The graph for ETL Data Movement in the Metrics tab of the Amazon Glue console.
Example Memory profile graph

The Memory Profile graph shows the following metrics:


                The graph for Memory Profile in the Metrics tab of the Amazon Glue console.
Example Data shuffle across executors graph

The Data Shuffle Across Executors graph shows the following metrics:


                The graph for Data Shuffle Across Executors in the Metrics tab of the Amazon Glue console.
Example CPU load graph

The CPU Load graph shows the following metrics:


                The graph for CPU Load in the Metrics tab of the Amazon Glue console.
Example Job execution graph

The Job Execution graph shows the following metrics:


                The graph for Job Execution in the Metrics tab of the Amazon Glue console.