Monitoring Amazon FSx for Lustre file systems - FSx for Lustre
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Monitoring Amazon FSx for Lustre file systems

Monitoring is an important part of maintaining the reliability, availability, and performance of your FSx for Lustre file system and your other Amazon solutions. Collecting monitoring data from all parts of your Amazon solution allows you to more easily debug a multi-point failure if one occurs. You can monitor your FSx for Lustre file system, report when something is wrong, and take action automatically when appropriate using the following tools:

  • Amazon CloudWatch – Monitors your Amazon resources and the applications that you run on Amazon in real time. You can collect and track metrics, create customized dashboards, and set alarms that notify you when a specified metric reaches a threshold that you specify. For example, you can have CloudWatch track storage capacity or other metrics for your Amazon FSx for Lustre instances and automatically launch new instances when needed.

  • Lustre logging – Monitors the enabled logging events for your file system. Lustre logging writes these events to Amazon CloudWatch Logs.

  • Amazon CloudTrail – Captures API calls and related events made by or on behalf of your Amazon Web Services account and delivers the log files to an Amazon S3 bucket that you specify. You can identify which users and accounts called Amazon, the source IP address from which the calls were made, and when the calls occurred.

The following sections provide information on how to use the tools with your FSx for Lustre file systems.