Monitoring Data Catalog usage metrics in Amazon CloudWatch
Amazon Glue Data Catalog usage metrics are now available with Amazon CloudWatch, simplifying the monitoring and understanding of resource utilization in your Data Catalog. You now have immediate visibility into your Glue Catalog API usage of catalogs, databases, tables, partitions, and connections, making it easier to maintain oversight of your Data Catalog.
Overview of Data Catalog metrics
Amazon Glue Data Catalog automatically publishes usage metrics to Amazon CloudWatch. With CloudWatch metrics integration, you can track critical performance indicators every minute, including:
-
Table requests
-
Partition indexes created
-
Connections updated
-
Statistics updated
These metrics help you identify bottlenecks, detect anomalies, and make data-driven decisions to improve overall data catalog reliability. You can also set up CloudWatch alarms to receive notifications when metrics exceed specified thresholds, allowing for proactive management of your deployment.
Adding metrics to your CloudWatch dashboard
You can create custom dashboards to monitor your Amazon Glue Data Catalog resources and set up alarms to be notified of any unusual activity.
You can add Data Catalog metrics to your CloudWatch dashboard by following these steps:
-
Open the CloudWatch console at https://console.amazonaws.cn/cloudwatch/
. -
In the navigation pane, choose Metrics.
-
Choose All metrics.
-
Choose Usage>By Amazon resource.
-
Filter by Glue to see available metrics.
-
Select the metrics you want to add to your dashboard.
-
Add metrics for catalogs, databases, tables, partitions, and connections to your CloudWatch graph.
You can configure custom alarms that trigger automatically when API usage exceeds your defined thresholds to identify abnormalities in your data catalog usage.
For detailed instructions on setting up alarms, see Creating a Metrics Insights CloudWatch alarm.