Logs and metrics supported by Amazon CloudWatch Application Insights - Amazon CloudWatch
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China.

Logs and metrics supported by Amazon CloudWatch Application Insights

The following lists show the supported logs and metrics for Amazon CloudWatch Application Insights.

CloudWatch Application Insights supports the following logs:

  • Microsoft Internet Information Services (IIS) logs

  • Error log for SQL Server on EC2

  • Custom .NET application logs, such as Log4Net

  • Windows Event logs, including Windows logs (System, Application, and Security) and Applications and Services log

  • Amazon CloudWatch Logs for Amazon Lambda

  • Error log and slow log for RDS MySQL, Aurora MySQL, and MySQL on EC2

  • Postgresql log for PostgreSQL RDS and PostgreSQL on EC2

  • Amazon CloudWatch Logs for Amazon Step Functions

  • Execution logs and access logs (JSON, CSV, and XML, but not CLF) for API Gateway REST API stages

  • Prometheus JMX exporter logs (EMF)

  • Alert logs and listener logs for Oracle on Amazon RDS and Oracle on Amazon EC2

  • Container logs routing from Amazon ECS containers to CloudWatch using awslogs log driver.

  • Container logs routing from Amazon ECS containers to CloudWatch using FireLens container log router.

  • Container logs routing from Amazon EKS or Kubernetes running on Amazon EC2 to CloudWatch using Fluent Bit or Fluentd log processor with Container Insights.

Amazon Elastic Compute Cloud (EC2)

CloudWatch built-in metrics

CPUCreditBalance

CPUCreditUsage

CPUSurplusCreditBalance

CPUSurplusCreditsCharged

CPUUtilization

DiskReadBytes

DiskReadOps

DiskWriteBytes

DiskWriteOps

EBSByteBalance%

EBSIOBalance%

EBSReadBytes

EBSReadOps

EBSWriteBytes

EBSWriteOps

NetworkIn

NetworkOut

NetworkPacketsIn

NetworkPacketsOut

StatusCheckFailed

StatusCheckFailed_Instance

StatusCheckFailed_System

CloudWatch agent metrics (Windows server)

.NET CLR Exceptions # of Exceps Thrown

.NET CLR Exceptions # of Exceps Thrown/Sec

.NET CLR Exceptions # of Filters/sec

.NET CLR Exceptions # of Finallys/sec

.NET CLR Exceptions Throw to Catch Depth/sec

.NET CLR Interop # of CCWs

.NET CLR Interop # of Stubs

.NET CLR Interop # of TLB exports/sec

.NET CLR Interop # of TLB imports/sec

.NET CLR Interop # of marshaling

.NET CLR Jit % Time in Jit

.NET CLR Jit Standard Jit Failures

.NET CLR Loading % Time Loading

.NET CLR Loading Rate of Load Failures

.NET CLR LocksAndThreads Contention Rate/sec

.NET CLR LocksAndThreads Queue Length/sec

.NET CLR Memory # Total Committed Bytes

.NET CLR Memory % Time in GC

.NET CLR Networking 4.0.0.0 HttpWebRequest Average Queue Time

.NET CLR Networking 4.0.0.0 HttpWebRequests Aborted/sec

.NET CLR Networking 4.0.0.0 HttpWebRequests Failed/sec

.NET CLR Networking 4.0.0.0 HttpWebRequests Queued/sec

APP_POOL_WAS Total Worker Process Ping Failures

ASP.NET Application Restarts

ASP.NET Applications % Managed Processor Time (estimated)

ASP.NET Applications Errors Total/Sec

ASP.NET Applications Errors Unhandled During Execution/sec

ASP.NET Applications Requests in Application Queue

ASP.NET Applications Requests/Sec

ASP.NET Request Wait Time

ASP.NET Requests Queued

HTTP Service Request Queues CurrentQueueSize

LogicalDisk % Free Space

Memory % Committed Bytes In Use

Memory Available Mbytes

Memory Pages/sec

Network Interface Bytes Total/sec

Paging File % Usage

PhysicalDisk % Disk Time

PhysicalDisk Avg. Disk Queue Length

PhysicalDisk Avg. Disk sec/Read

PhysicalDisk Avg. Disk sec/Write

PhysicalDisk Disk Read Bytes/sec

PhysicalDisk Disk Reads/sec

PhysicalDisk Disk Write Bytes/sec

PhysicalDisk Disk Writes/sec

Processor % Idle Time

Processor % Interrupt Time

Processor % Processor Time

Processor % User Time

SQLServer:Access Methods Forwarded Records/sec

SQLServer:Access Methods Full Scans/sec

SQLServer:Access Methods Page Splits/sec

SQLServer:Buffer Manager Buffer cache hit ratio

SQLServer:Buffer Manager Page life expectancy

SQLServer:General Statistics Processes blocked

SQLServer:General Statistics User Connections

SQLServer:Latches Average Latch Wait Time (ms)

SQLServer:Locks Average Wait Time (ms)

SQLServer:Locks Lock Timeouts/sec

SQLServer:Locks Lock Waits/sec

SQLServer:Locks Number of Deadlocks/sec

SQLServer:Memory Manager Memory Grants Pending

SQLServer:SQL Statistics Batch Requests/sec

SQLServer:SQL Statistics SQL Compilations/sec

SQLServer:SQL Statistics SQL Re-Compilations/sec

System Processor Queue Length

TCPv4 Connections Established

TCPv6 Connections Established

W3SVC_W3WP File Cache Flushes

W3SVC_W3WP File Cache Misses

W3SVC_W3WP Requests/Sec

W3SVC_W3WP URI Cache Flushes

W3SVC_W3WP URI Cache Misses

Web Service Bytes Received/Sec

Web Service Bytes Sent/Sec

Web Service Connection attempts/sec

Web Service Current Connections

Web Service Get Requests/sec

Web Service Post Requests/sec

Bytes Received/sec

Normal Messages Queue Length/sec

Urgent Message Queue Length/sec

Reconnect Count

Unacknowledged Message Queue Length/sec

Messages Outstanding

Messages Sent/sec

Database Update Messages/sec

Update Messages/sec

Flushes/sec

Crypto Checkpoints Saved/sec

Crypto Checkpoints Restored/sec

Registry Checkpoints Restored/sec

Registry Checkpoints Saved/sec

Cluster API Calls/sec

Resource API Calls/sec

Cluster Handles/sec

Resource Handles/sec

CloudWatch agent metrics (Linux server)

cpu_time_active

cpu_time_guest

cpu_time_guest_nice

cpu_time_idle

cpu_time_iowait

cpu_time_irq

cpu_time_nice

cpu_time_softirq

cpu_time_steal

cpu_time_system

cpu_time_user

cpu_usage_active

cpu_usage_guest

cpu_usage_guest_nice

cpu_usage_idle

cpu_usage_iowait

cpu_usage_irq

cpu_usage_nice

cpu_usage_softirq

cpu_usage_steal

cpu_usage_system

cpu_usage_user

disk_free

disk_inodes_free

disk_inodes_used

disk_used

disk_used_percent

diskio_io_time

diskio_iops_in_progress

diskio_read_bytes

diskio_read_time

diskio_reads

diskio_write_bytes

diskio_write_time

diskio_writes

mem_active

mem_available

mem_available_percent

mem_buffered

mem_cached

mem_free

mem_inactive

mem_used

mem_used_percent

net_bytes_recv

net_bytes_sent

net_drop_in

net_drop_out

net_err_in

net_err_out

net_packets_recv

net_packets_sent

netstat_tcp_close

netstat_tcp_close_wait

netstat_tcp_closing

netstat_tcp_established

netstat_tcp_fin_wait1

netstat_tcp_fin_wait2

netstat_tcp_last_ack

netstat_tcp_listen

netstat_tcp_none

netstat_tcp_syn_recv

netstat_tcp_syn_sent

netstat_tcp_time_wait

netstat_udp_socket

processes_blocked

processes_dead

processes_idle

processes_paging

processes_running

processes_sleeping

processes_stopped

processes_total

processes_total_threads

processes_wait

processes_zombies

swap_free

swap_used

swap_used_percent

Elastic Block Store (EBS)

VolumeReadBytes

VolumeWriteBytes

VolumeReadOps

VolumeWriteOps

VolumeTotalReadTime

VolumeTotalWriteTime

VolumeIdleTime

VolumeQueueLength

VolumeThroughputPercentage

VolumeConsumedReadWriteOps

BurstBalance

Elastic Load Balancer (ELB)

EstimatedALBActiveConnectionCount

EstimatedALBConsumedLCUs

EstimatedALBNewConnectionCount

EstimatedProcessedBytes

HTTPCode_Backend_4XX

HTTPCode_Backend_5XX

HealthyHostCount

RequestCount

UnHealthyHostCount

Application ELB

EstimatedALBActiveConnectionCount

EstimatedALBConsumedLCUs

EstimatedALBNewConnectionCount

EstimatedProcessedBytes

HTTPCode_Backend_4XX

HTTPCode_Backend_5XX

HealthyHostCount

Latency

RequestCount

SurgeQueueLength

UnHealthyHostCount

Amazon EC2 Auto Scaling groups

CPUCreditBalance

CPUCreditUsage

CPUSurplusCreditBalance

CPUSurplusCreditsCharged

CPUUtilization

DiskReadBytes

DiskReadOps

DiskWriteBytes

DiskWriteOps

EBSByteBalance%

EBSIOBalance%

EBSReadBytes

EBSReadOps

EBSWriteBytes

EBSWriteOps

NetworkIn

NetworkOut

NetworkPacketsIn

NetworkPacketsOut

StatusCheckFailed

StatusCheckFailed_Instance

StatusCheckFailed_System

Amazon Simple Queue Server (SQS)

ApproximateAgeOfOldestMessage

ApproximateNumberOfMessagesDelayed

ApproximateNumberOfMessagesNotVisible

ApproximateNumberOfMessagesVisible

NumberOfEmptyReceives

NumberOfMessagesDeleted

NumberOfMessagesReceived

NumberOfMessagesSent

Amazon Relational Database Service (RDS)

RDS Database instances

BurstBalance

CPUCreditBalance

CPUUtilization

DatabaseConnections

DiskQueueDepth

FailedSQLServerAgentJobsCount

FreeStorageSpace

FreeableMemory

NetworkReceiveThroughput

NetworkTransmitThroughput

ReadIOPS

ReadLatency

ReadThroughput

WriteIOPS

WriteLatency

WriteThroughput

RDS Database clusters

ActiveTransactions

AuroraBinlogReplicaLag

AuroraReplicaLag

BackupRetentionPeriodStorageUsed

BinLogDiskUsage

BlockedTransactions

BufferCacheHitRatio

CPUUtilization

CommitLatency

CommitThroughput

DDLLatency

DDLThroughput

DMLLatency

DMLThroughput

DatabaseConnections

Deadlocks

DeleteLatency

DeleteThroughput

EngineUptime

FreeLocalStorage

FreeableMemory

InsertLatency

InsertThroughput

LoginFailures

NetworkReceiveThroughput

NetworkThroughput

NetworkTransmitThroughput

Queries

ResultSetCacheHitRatio

SelectLatency

SelectThroughput

SnapshotStorageUsed

TotalBackupStorageBilled

UpdateLatency

UpdateThroughput

VolumeBytesUsed

VolumeReadIOPs

VolumeWriteIOPs

Amazon Lambda function

Errors

DeadLetterErrors

Duration

Throttles

IteratorAge

ProvisionedConcurrencySpilloverInvocations

Amazon DynamoDB table

SystemErrors

UserErrors

ConsumedReadCapacityUnits

ConsumedWriteCapacityUnits

ReadThrottleEvents

WriteThrottleEvents

TimeToLiveDeletedItemCount

ConditionalCheckFailedRequests

TransactionConflict

ReturnedRecordsCount

PendingReplicationCount

ReplicationLatency

Amazon S3 bucket

ReplicationLatency

BytesPendingReplication

OperationsPendingReplication

4xxErrors

5xxErrors

AllRequests

GetRequests

PutRequests

DeleteRequests

HeadRequests

PostRequests

SelectRequests

ListRequests

SelectScannedBytes

SelectReturnedBytes

FirstByteLatency

TotalRequestLatency

BytesDownloaded

BytesUploaded

Amazon Step Functions

Execution-level

ExecutionTime

ExecutionThrottled

ExecutionsFailed

ExecutionsTimedOut

ExecutionsAborted

ExecutionsSucceeded

ExecutionsStarted

Activity

ActivityRunTime

ActivityScheduleTime

ActivityTime

ActivitiesFailed

ActivitiesHeartbeatTimedOut

ActivitiesTimedOut

ActivitiesScheduled

ActivitiesSucceeded

ActivitiesStarted

Lambda function

LambdaFunctionRunTime

LambdaFunctionScheduleTime

LambdaFunctionTime

LambdaFunctionsFailed

LambdaFunctionsTimedOut

LambdaFunctionsScheduled

LambdaFunctionsSucceeded

LambdaFunctionsStarted

Service integration

ServiceIntegrationRunTime

ServiceIntegrationScheduleTime

ServiceIntegrationTime

ServiceIntegrationsFailed

ServiceIntegrationsTimedOut

ServiceIntegrationsScheduled

ServiceIntegrationsSucceeded

ServiceIntegrationsStarted

Step Functions API

ThrottledEvents

ProvisionedBucketSize

ProvisionedRefillRate

ConsumedCapacity

API Gateway REST API stages

4XXError

5XXError

IntegrationLatency

Latency

DataProcessed

CacheHitCount

CacheMissCount

Java

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_openfiledescriptorcount

java_lang_operatingsystem_maxfiledescriptorcount

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

java_lang_threading_threadcount

java_lang_threading_daemonthreadcount

java_lang_classloading_loadedclasscount

java_lang_garbagecollector_collectiontime_copy

java_lang_garbagecollector_collectiontime_ps_scavenge

java_lang_garbagecollector_collectiontime_parnew

java_lang_garbagecollector_collectiontime_marksweepcompact

java_lang_garbagecollector_collectiontime_ps_marksweep

java_lang_garbagecollector_collectiontime_concurrentmarksweep

java_lang_garbagecollector_collectiontime_g1_young_generation

java_lang_garbagecollector_collectiontime_g1_old_generation

java_lang_garbagecollector_collectiontime_g1_mixed_generation

java_lang_operatingsystem_committedvirtualmemorysize

Amazon Elastic Container Service (Amazon ECS)

CloudWatch built-in metrics

CPUReservation

CPUUtilization

MemoryReservation

MemoryUtilization

GPUReservation

Container Insights metrics

ContainerInstanceCount

CpuUtilized

CpuReserved

DeploymentCount

DesiredTaskCount

MemoryUtilized

MemoryReserved

NetworkRxBytes

NetworkTxBytes

PendingTaskCount

RunningTaskCount

ServiceCount

StorageReadBytes

StorageWriteBytes

TaskCount

TaskSetCount

instance_cpu_limit

instance_cpu_reserved_capacity

instance_cpu_usage_total

instance_cpu_utilization

instance_filesystem_utilization

instance_memory_limit

instance_memory_reserved_capacity

instance_memory_utilization

instance_memory_working_set

instance_network_total_bytes

instance_number_of_running_tasks

Container Insights Prometheus metrics

Java JMX metrics

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_openfiledescriptorcount

java_lang_operatingsystem_maxfiledescriptorcount

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_threading_daemonthreadcount

java_lang_garbagecollector_collectiontime_copy

java_lang_garbagecollector_collectiontime_ps_scavenge

java_lang_garbagecollector_collectiontime_parnew

java_lang_garbagecollector_collectiontime_marksweepcompact

java_lang_garbagecollector_collectiontime_ps_marksweep

java_lang_garbagecollector_collectiontime_concurrentmarksweep

java_lang_garbagecollector_collectiontime_g1_young_generation

java_lang_garbagecollector_collectiontime_g1_old_generation

java_lang_garbagecollector_collectiontime_g1_mixed_generation

java_lang_operatingsystem_committedvirtualmemorysize

Kubernetes on Amazon

Container Insights metrics

cluster_failed_node_count

cluster_node_count

namespace_number_of_running_pods

node_cpu_limit

node_cpu_reserved_capacity

node_cpu_usage_total

node_cpu_utilization

node_filesystem_utilization

node_memory_limit

node_memory_reserved_capacity

node_memory_utilization

node_memory_working_set

node_network_total_bytes

node_number_of_running_containers

node_number_of_running_pods

pod_cpu_reserved_capacity

pod_cpu_utilization

pod_cpu_utilization_over_pod_limit

pod_memory_reserved_capacity

pod_memory_utilization

pod_memory_utilization_over_pod_limit

pod_network_rx_bytes

pod_network_tx_bytes

service_number_of_running_pods

Container Insights Prometheus metrics

Java JMX metrics

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_openfiledescriptorcount

java_lang_operatingsystem_maxfiledescriptorcount

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_threading_daemonthreadcount

java_lang_garbagecollector_collectiontime_copy

java_lang_garbagecollector_collectiontime_ps_scavenge

java_lang_garbagecollector_collectiontime_parnew

java_lang_garbagecollector_collectiontime_marksweepcompact

java_lang_garbagecollector_collectiontime_ps_marksweep

java_lang_garbagecollector_collectiontime_concurrentmarksweep

java_lang_garbagecollector_collectiontime_g1_young_generation

java_lang_garbagecollector_collectiontime_g1_old_generation

java_lang_garbagecollector_collectiontime_g1_mixed_generation

java_lang_operatingsystem_committedvirtualmemorysize

Amazon FSx

DataReadBytes

DataWriteBytes

DataReadOperations

DataWriteOperations

MetadataOperations

FreeStorageCapacity

FreeDataStorageCapacity

LogicalDiskUsage

PhysicalDiskUsage

Metrics with datapoints requirements

For metrics without an obvious default threshold to alarm on, Application Insights waits until the metric has enough data points to predict a reasonable threshold to alarm on. The metric datapoints requirement that CloudWatch Application Insights checks before an alarm is created are:

  • The metric has at least 100 datapoints from the past 15 to the past 2 days.

  • The metric has at least 100 datapoints from the last day.

The following metrics follow these datapoints requirements. Note that CloudWatch agent metrics require up to one hour to create alarms.

AWS/ApplicationELB

ActiveConnectionCount

ConsumedLCUs

HTTPCode_ELB_4XX_Count

HTTPCode_Target_2XX_Count

HTTPCode_Target_3XX_Count

HTTPCode_Target_4XX_Count

HTTPCode_Target_5XX_Count

NewConnectionCount

ProcessedBytes

TargetResponseTime

UnHealthyHostCount

AWS/AutoScaling

GroupDesiredCapacity

GroupInServiceInstances

GroupMaxSize

GroupMinSize

GroupPendingInstances

GroupStandbyInstances

GroupTerminatingInstances

GroupTotalInstances

AWS/EC2

CPUCreditBalance

CPUCreditUsage

CPUSurplusCreditBalance

CPUSurplusCreditsCharged

CPUUtilization

DiskReadBytes

DiskReadOps

DiskWriteBytes

DiskWriteOps

EBSByteBalance%

EBSIOBalance%

EBSReadBytes

EBSReadOps

EBSWriteBytes

EBSWriteOps

NetworkIn

NetworkOut

NetworkPacketsIn

NetworkPacketsOut

Elastic Block Store (EBS)

VolumeReadBytes

VolumeWriteBytes

VolumeReadOps

VolumeWriteOps

VolumeTotalReadTime

VolumeTotalWriteTime

VolumeIdleTime

VolumeQueueLength

VolumeThroughputPercentage

VolumeConsumedReadWriteOps

BurstBalance

AWS/ELB

EstimatedALBActiveConnectionCount

EstimatedALBConsumedLCUs

EstimatedALBNewConnectionCount

EstimatedProcessedBytes

HTTPCode_Backend_4XX

HTTPCode_Backend_5XX

HealthyHostCount

Latency

RequestCount

SurgeQueueLength

UnHealthyHostCount

AWS/RDS

ActiveTransactions

AuroraBinlogReplicaLag

AuroraReplicaLag

BackupRetentionPeriodStorageUsed

BinLogDiskUsage

BlockedTransactions

CPUCreditBalance

CommitLatency

CommitThroughput

DDLLatency

DDLThroughput

DMLLatency

DMLThroughput

DatabaseConnections

Deadlocks

DeleteLatency

DeleteThroughput

DiskQueueDepth

EngineUptime

FreeLocalStorage

FreeStorageSpace

FreeableMemory

InsertLatency

InsertThroughput

LoginFailures

NetworkReceiveThroughput

NetworkThroughput

NetworkTransmitThroughput

Queries

ReadIOPS

ReadThroughput

SelectLatency

SelectThroughput

SnapshotStorageUsed

TotalBackupStorageBilled

UpdateLatency

UpdateThroughput

VolumeBytesUsed

VolumeReadIOPs

VolumeWriteIOPs

WriteIOPS

WriteThroughput

AWS/Lambda

Errors

DeadLetterErrors

Duration

Throttles

IteratorAge

ProvisionedConcurrencySpilloverInvocations

AWS/SQS

ApproximateAgeOfOldestMessage

ApproximateNumberOfMessagesDelayed

ApproximateNumberOfMessagesNotVisible

ApproximateNumberOfMessagesVisible

NumberOfEmptyReceives

NumberOfMessagesDeleted

NumberOfMessagesReceived

NumberOfMessagesSent

AWS/CWAgent

LogicalDisk % Free Space

Memory % Committed Bytes In Use

Memory Available Mbytes

Network Interface Bytes Total/sec

Paging File % Usage

PhysicalDisk % Disk Time

PhysicalDisk Avg. Disk sec/Read

PhysicalDisk Avg. Disk sec/Write

PhysicalDisk Disk Read Bytes/sec

PhysicalDisk Disk Reads/sec

PhysicalDisk Disk Write Bytes/sec

PhysicalDisk Disk Writes/sec

Processor % Idle Time

Processor % Interrupt Time

Processor % Processor Time

Processor % User Time

SQLServer:Access Methods Forwarded Records/sec

SQLServer:Access Methods Page Splits/sec

SQLServer:Buffer Manager Buffer cache hit ratio

SQLServer:Buffer Manager Page life expectancy

SQLServer:Database Replica File Bytes Received/sec

SQLServer:Database Replica Log Bytes Received/sec

SQLServer:Database Replica Log remaining for undo

SQLServer:Database Replica Log Send Queue

SQLServer:Database Replica Mirrored Write Transaction/sec

SQLServer:Database Replica Recovery Queue

SQLServer:Database Replica Redo Bytes Remaining

SQLServer:Database Replica Redone Bytes/sec

SQLServer:Database Replica Total Log requiring undo

SQLServer:Database Replica Transaction Delay

SQLServer:General Statistics Processes blocked

SQLServer:SQL Statistics Batch Requests/sec

SQLServer:SQL Statistics SQL Compilations/sec

SQLServer:SQL Statistics SQL Re-Compilations/sec

System Processor Queue Length

TCPv4 Connections Established

TCPv6 Connections Established

AWS/DynamoDB

ConsumedReadCapacityUnits

ConsumedWriteCapacityUnits

ReadThrottleEvents

WriteThrottleEvents

TimeToLiveDeletedItemCount

ConditionalCheckFailedRequests

TransactionConflict

ReturnedRecordsCount

PendingReplicationCount

ReplicationLatency

AWS/S3

ReplicationLatency

BytesPendingReplication

OperationsPendingReplication

4xxErrors

5xxErrors

AllRequests

GetRequests

PutRequests

DeleteRequests

HeadRequests

PostRequests

SelectRequests

ListRequests

SelectScannedBytes

SelectReturnedBytes

FirstByteLatency

TotalRequestLatency

BytesDownloaded

BytesUploaded

AWS/States

ActivitiesScheduled

ActivitiesStarted

ActivitiesSucceeded

ActivityScheduleTime

ActivityRuntime

ActivityTime

LambdaFunctionsScheduled

LambdaFunctionsStarted

LambdaFunctionsSucceeded

LambdaFunctionScheduleTime

LambdaFunctionRuntime

LambdaFunctionTime

ServiceIntegrationsScheduled

ServiceIntegrationsStarted

ServiceIntegrationsSucceeded

ServiceIntegrationScheduleTime

ServiceIntegrationRuntime

ServiceIntegrationTime

ProvisionedRefillRate

ProvisionedBucketSize

ConsumedCapacity

ThrottledEvents

AWS/ApiGateway

4XXError

IntegrationLatency

Latency

DataProcessed

CacheHitCount

CacheMissCount

The following table lists the recommended metrics for each component type.

Component type Workload type Recommended metric

EC2 instance (Windows servers)

Java Application

CPUUtilization

StatusCheckFailed

Processor % Processor Time

Memory % Committed Bytes In Use

Memory Available Mbytes

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

Microsoft IIS/.NET Web Front-End

CPUUtilization

StatusCheckFailed

Processor % Processor Time

Memory % Committed Bytes In Use

Memory Available Mbytes

.NET CLR Exceptions # of Exceps Thrown/Sec

.NET CLR Memory # Total Committed Bytes

.NET CLR Memory % Time in GC

ASP.NET Applications Requests in Application Queue

ASP.NET Requests Queued

ASP.NET Application Restarts

Microsoft SQL Server Database Tier

CPUUtilization

StatusCheckFailed

Processor % Processor Time

Memory % Committed Bytes In Use

Memory Available Mbytes

Paging File % Usage

System Processor Queue Length

Network Interface Bytes Total/Sec

PhysicalDisk % Disk Time

SQLServer:Buffer Manager Buffer Cache Hit ratio

SQLServer:Buffer Manager Page Life Expectancy

SQLServer:General Statistics Processes Blocked

SQLServer:General Statistics User Connections

SQLServer:Locks Number of Deadlocks/Sec

SQLServer:SQL Statistics Batch Requests/Sec

.NET workerpool/Mid-Tier

CPUUtilization

StatusCheckFailed

Processor % Processor Time

Memory % Committed Bytes In Use

Memory Available Mbytes

.NET CLR Exceptions # of Exceps Thrown/Sec

.NET CLR Memory # Total Committed Bytes

.NET CLR Memory % Time in GC

.NET Core Tier

CPUUtilization

StatusCheckFailed

Processor % Processor Time

Memory % Committed Bytes In Use

Memory Available Mbytes

EC2 instance (Linux servers)

Java Application

CPUUtilization

StatusCheckFailed

disk_used_percent

mem_used_percent

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

.NET Core Tier or SQL Server Database Tier

CPUUtilization

StatusCheckFailed

disk_used_percent

mem_used_percent

EBS volume Any

VolumeReadBytes

VolumeWriteBytes

VolumeReadOps

VolumeWriteOps

VolumeQueueLength

VolumeThroughputPercentage

VolumeConsumedReadWriteOps

BurstBalance

Classic ELB

Any

HTTPCode_Backend_4XX

HTTPCode_Backend_5XX

Latency

SurgeQueueLength

UnHealthyHostCount

Application ELB

Any

HTTPCode_Target_4XX_Count

HTTPCode_Target_5XX_Count

TargetResponseTime

UnHealthyHostCount

RDS Database instance

Any

CPUUtilization

ReadLatency

WriteLatency

BurstBalance

FailedSQLServerAgentJobsCount

RDS Database cluster Any

CPUUtilization

CommitLatency

DatabaseConnections

Deadlocks

FreeableMemory

NetworkThroughput

VolumeBytesUsed

Lambda Function

Any

Duration

Errors

IteratorAge

ProvisionedConcurrencySpilloverInvocations

Throttles

SQS Queue

Any

ApproximateAgeOfOldestMessage

ApproximateNumberOfMessagesVisible

NumberOfMessagesSent

Amazon DynamoDB table Any

SystemErrors

UserErrors

ConsumedReadCapacityUnits

ConsumedWriteCapacityUnits

ReadThrottleEvents

WriteThrottleEvents

ConditionalCheckFailedRequests

TransactionConflict

Amazon S3 bucket

Any

If replication configuration with Replication Time Control (RTC) is enabled:

ReplicationLatency

BytesPendingReplication

OperationsPendingReplication

If request metrics are turned on:

5xxErrors

4xxErrors

BytesDownloaded

BytesUploaded

Amazon Step Functions

Any

General

  • ExecutionThrottled

  • ExecutionsAborted

  • ProvisionedBucketSize

  • ProvisionedRefillRate

  • ConsumedCapacity

If state machine type is EXPRESS or log group level is OFF

  • ExecutionsFailed

  • ExecutionsTimedOut

If state machine has Lambda functions

  • LambdaFunctionsFailed

  • LambdaFunctionsTimedOut

If state machine has activities

  • ActivitiesFailed

  • ActivitiesTimedOut

  • ActivitiesHeartbeatTimedOut

If state machine has service integrations

  • ServiceIntegrationsFailed

  • ServiceIntegrationsTimedOut

API Gateway REST API stage

Any
  • 4XXErrors

  • 5XXErrors

  • Latency

ECS Cluster

Any

CpuUtilized

MemoryUtilized

NetworkRxBytes

NetworkTxBytes

RunningTaskCount

PendingTaskCount

StorageReadBytes

StorageWriteBytes

CPUReservation (EC2 Launch Type only)

CPUUtilization (EC2 Launch Type only)

MemoryReservation (EC2 Launch Type only)

MemoryUtilization (EC2 Launch Type only)

GPUReservation (EC2 Launch Type only)

instance_cpu_utilization (EC2 Launch Type only)

instance_filesystem_utilization (EC2 Launch Type only)

instance_memory_utilization (EC2 Launch Type only)

instance_network_total_bytes (EC2 Launch Type only)

Java Application

CpuUtilized

MemoryUtilized

NetworkRxBytes

NetworkTxBytes

RunningTaskCount

PendingTaskCount

StorageReadBytes

StorageWriteBytes

CPUReservation (EC2 Launch Type only)

CPUUtilization (EC2 Launch Type only)

MemoryReservation (EC2 Launch Type only)

MemoryUtilization (EC2 Launch Type only)

GPUReservation (EC2 Launch Type only)

instance_cpu_utilization (EC2 Launch Type only)

instance_filesystem_utilization (EC2 Launch Type only)

instance_memory_utilization (EC2 Launch Type only)

instance_network_total_bytes (EC2 Launch Type only)

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

ECS Service

Any

CPUUtilization

MemoryUtilization

CpuUtilized

MemoryUtilized

NetworkRxBytes

NetworkTxBytes

RunningTaskCount

PendingTaskCount

StorageReadBytes

StorageWriteBytes

Java Application

CPUUtilization

MemoryUtilization

CpuUtilized

MemoryUtilized

NetworkRxBytes

NetworkTxBytes

RunningTaskCount

PendingTaskCount

StorageReadBytes

StorageWriteBytes

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

EKS Cluster

Any

cluster_failed_node_count

node_cpu_reserved_capacity

node_cpu_utilization

node_filesystem_utilization

node_memory_reserved_capacity

node_memory_utilization

node_network_total_bytes

pod_cpu_reserved_capacity

pod_cpu_utilization

pod_cpu_utilization_over_pod_limit

pod_memory_reserved_capacity

pod_memory_utilization

pod_memory_utilization_over_pod_limit

pod_network_rx_bytes

pod_network_tx_bytes

Java Application

cluster_failed_node_count

node_cpu_reserved_capacity

node_cpu_utilization

node_filesystem_utilization

node_memory_reserved_capacity

node_memory_utilization

node_network_total_bytes

pod_cpu_reserved_capacity

pod_cpu_utilization

pod_cpu_utilization_over_pod_limit

pod_memory_reserved_capacity

pod_memory_utilization

pod_memory_utilization_over_pod_limit

pod_network_rx_bytes

pod_network_tx_bytes

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

Kubernetes Cluster on EC2

Any

cluster_failed_node_count

node_cpu_reserved_capacity

node_cpu_utilization

node_filesystem_utilization

node_memory_reserved_capacity

node_memory_utilization

node_network_total_bytes

pod_cpu_reserved_capacity

pod_cpu_utilization

pod_cpu_utilization_over_pod_limit

pod_memory_reserved_capacity

pod_memory_utilization

pod_memory_utilization_over_pod_limit

pod_network_rx_bytes

pod_network_tx_bytes

Java Application

cluster_failed_node_count

node_cpu_reserved_capacity

node_cpu_utilization

node_filesystem_utilization

node_memory_reserved_capacity

node_memory_utilization

node_network_total_bytes

pod_cpu_reserved_capacity

pod_cpu_utilization

pod_cpu_utilization_over_pod_limit

pod_memory_reserved_capacity

pod_memory_utilization

pod_memory_utilization_over_pod_limit

pod_network_rx_bytes

pod_network_tx_bytes

java_lang_threading_threadcount

java_lang_classloading_loadedclasscount

java_lang_memory_heapmemoryusage_used

java_lang_memory_heapmemoryusage_committed

java_lang_operatingsystem_freephysicalmemorysize

java_lang_operatingsystem_freeswapspacesize

Performance Counter metrics

Performance Counter metrics are recommended for instances only when the corresponding Performance Counter sets are installed on the Windows instances.

Performance Counter metric name Performance Counter set name
.NET CLR Exceptions # of Exceps Thrown .NET CLR Exceptions
.NET CLR Exceptions # of Exceps Thrown/Sec .NET CLR Exceptions
.NET CLR Exceptions # of Filters/Sec .NET CLR Exceptions
.NET CLR Exceptions # of Finallys/Sec .NET CLR Exceptions
.NET CLR Exceptions Throw to Catch Depth/Sec .NET CLR Exceptions
.NET CLR Interop # of CCWs .NET CLR Interop
.NET CLR Interop # of Stubs .NET CLR Interop
.NET CLR Interop # of TLB exports/Sec .NET CLR Interop
.NET CLR Interop # of TLB imports/Sec .NET CLR Interop
.NET CLR Interop # of Marshaling .NET CLR Interop
.NET CLR Jit % Time in Jit .NET CLR Jit
.NET CLR Jit Standard Jit Failures .NET CLR Jit
.NET CLR Loading % Time Loading .NET CLR Loading
.NET CLR Loading Rate of Load Failures .NET CLR Loading
.NET CLR LocksAndThreads Contention Rate/Sec .NET CLR LocksAndThreads
.NET CLR LocksAndThreads Queue Length/Sec .NET CLR LocksAndThreads
.NET CLR Memory # Total Committed Bytes .NET CLR Memory
.NET CLR Memory % Time in GC .NET CLR Memory
.NET CLR Networking 4.0.0.0 HttpWebRequest Average Queue Time .NET CLR Networking 4.0.0.0
.NET CLR Networking 4.0.0.0 HttpWebRequests Aborted/Sec .NET CLR Networking 4.0.0.0
.NET CLR Networking 4.0.0.0 HttpWebRequests Failed/Sec .NET CLR Networking 4.0.0.0
.NET CLR Networking 4.0.0.0 HttpWebRequests Queued/Sec .NET CLR Networking 4.0.0.0
APP_POOL_WAS Total Worker Process Ping Failures APP_POOL_WAS
ASP.NET Application Restarts ASP.NET
ASP.NET Applications % Managed Processor Time (estimated) ASP.NET Applications
ASP.NET Applications Errors Total/Sec ASP.NET Applications

ASP.NET Applications Errors Unhandled During Execution/Sec

ASP.NET Applications

ASP.NET Applications Requests in Application Queue

ASP.NET Applications

ASP.NET Applications Requests/Sec

ASP.NET Applications

ASP.NET Request Wait Time ASP.NET
ASP.NET Requests Queued ASP.NET
HTTP Service Request Queues CurrentQueueSize HTTP Service Request Queues

LogicalDisk % Free Space

LogicalDisk

Memory % Committed Bytes In Use

Memory

Memory Available Mbytes

Memory

Memory Pages/Sec

Memory

Network Interface Bytes Total/Sec

Network Interface

Paging File % Usage

Paging File

PhysicalDisk % Disk Time PhysicalDisk
PhysicalDisk Avg. Disk Queue Length PhysicalDisk
PhysicalDisk Avg. Disk Sec/Read PhysicalDisk
PhysicalDisk Avg. Disk Sec/Write PhysicalDisk
PhysicalDisk Disk Read Bytes/Sec PhysicalDisk
PhysicalDisk Disk Reads/Sec PhysicalDisk
PhysicalDisk Disk Write Bytes/Sec PhysicalDisk
PhysicalDisk Disk Writes/Sec PhysicalDisk

Processor % Idle Time

Processor
Processor % Interrupt Time

Processor

Processor % Processor Time

Processor

Processor % User Time

Processor

SQLServer:Access Methods Forwarded Records/Sec

SQLServer:Access Methods

SQLServer:Access Methods Full Scans/Sec

SQLServer:Access Methods

SQLServer:Access Methods Page Splits/Sec

SQLServer:Access Methods

SQLServer:Buffer Manager Buffer cache hit Ratio

SQLServer:Buffer Manager

SQLServer:Buffer Manager Page life Expectancy

SQLServer:Buffer Manager

SQLServer:Database Replica File Bytes Received/sec SQLServer:Database Replica
SQLServer:Database Replica Log Bytes Received/sec SQLServer:Database Replica
SQLServer:Database Replica Log remaining for undo SQLServer:Database Replica
SQLServer:Database Replica Log Send Queue SQLServer:Database Replica
SQLServer:Database Replica Mirrored Write Transaction/sec SQLServer:Database Replica
SQLServer:Database Replica Recovery Queue SQLServer:Database Replica
SQLServer:Database Replica Redo Bytes Remaining SQLServer:Database Replica
SQLServer:Database Replica Redone Bytes/sec SQLServer:Database Replica
SQLServer:Database Replica Total Log requiring undo SQLServer:Database Replica
SQLServer:Database Replica Transaction Delay SQLServer:Database Replica
SQLServer:General Statistics Processes Blocked

SQLServer:General Statistics

SQLServer:General Statistics User Connections SQLServer:General Statistics
SQLServer:Latches Average Latch Wait Time (ms) SQLServer:Latches
SQLServer:Locks Average Wait Time (ms) SQLServer:Locks
SQLServer:Locks Lock Timeouts/Sec SQLServer:Locks
SQLServer:Locks Lock Waits/Sec SQLServer:Locks
SQLServer:Locks Number of Deadlocks/Sec SQLServer:Locks
SQLServer:Memory Manager Memory Grants Pending SQLServer:Memory Manager
SQLServer:SQL Statistics Batch Requests/Sec SQLServer:SQL Statistics
SQLServer:SQL Statistics SQL Compilations/Sec SQLServer:SQL Statistics
SQLServer:SQL Statistics SQL Re-Compilations/Sec

SQLServer:SQL Statistics

System Processor Queue Length

System

TCPv4 Connections Established TCPv4
TCPv6 Connections Established TCPv6
W3SVC_W3WP File Cache Flushes W3SVC_W3WP
W3SVC_W3WP File Cache Misses W3SVC_W3WP
W3SVC_W3WP Requests/Sec W3SVC_W3WP
W3SVC_W3WP URI Cache Flushes W3SVC_W3WP
W3SVC_W3WP URI Cache Misses W3SVC_W3WP
Web Service Bytes Received/Sec

Web Service

Web Service Bytes Sent/Sec

Web Service

Web Service Connection Attempts/Sec

Web Service

Web Service Current Connections

Web Service

Web Service Get Requests/Sec

Web Service

Web Service Post Requests/Sec

Web Service