Use Amazon MSK storage capacity alerts
On Amazon MSK provisioned clusters, you choose the cluster's primary storage capacity. If you
exhaust the storage capacity on a broker in your provisioned cluster, it can affect its
ability to produce and consume data, leading to costly downtime. Amazon MSK offers CloudWatch
metrics to help you monitor your cluster's storage capacity. However, to make it easier
for you to detect and resolve storage capacity issues, Amazon MSK automatically sends you
dynamic cluster storage capacity alerts. The storage capacity alerts include
recommendations for short-term and long-term steps to manage your cluster's storage
capacity. From the Amazon MSK console
There are two types of MSK storage capacity alerts: proactive and remedial.
-
Proactive ("Action required") storage capacity alerts warn you about potential storage issues with your cluster. When a broker in an MSK cluster has used over 60% or 80% of its disk storage capacity, you'll receive proactive alerts for the affected broker.
-
Remedial ("Critical action required") storage capacity alerts require you to take remedial action to fix a critical cluster issue when one of the brokers in your MSK cluster has run out of disk storage capacity.
Amazon MSK automatically sends these alerts to the Amazon MSK console
Storage capacity alerts are enabled by default for all MSK provisioned clusters and can't be turned off. This feature is supported in all regions where MSK is available.
Monitor storage capacity alerts
You can check for storage capacity alerts in several ways:
Go to the Amazon MSK console
. Storage capacity alerts are displayed in the cluster alerts pane for 90 days. The alerts contain recommendations and single-click link actions to address disk storage capacity issues. Use ListClusters, ListClustersV2, DescribeCluster, or DescribeClusterV2 APIs to view
CustomerActionStatus
and all the alerts for a cluster.Go to the Amazon Health Dashboard
to view alerts from MSK and other Amazon services. Set up Amazon Health API and Amazon EventBridge to route alert notifications to 3rd party platforms such as Datadog, NewRelic, and Slack.