What is ARC?
Amazon Application Recovery Controller (ARC) helps you prepare for and complete faster recovery for applications running on the Amazon Global Cloud Infrastructure.
ARC provides the following capabilities:
Multi-Availability Zone (AZ) recovery, including zonal shift and zonal autoshift, which enable you to recover from single AZ impairments by temporarily shifting traffic from an impaired AZ to a healthy AZ.
Multi-Region recovery, which includes routing control for failover and readiness check for application monitoring.
Multi-Availability Zone recovery
Zonal shift
You can use ARC zonal shift to quickly isolate and recover from single Availability Zone (AZ) impairments. Zonal shift temporarily shifts traffic for a supported resource away from an impaired AZ to healthy AZs in the same Amazon Region. Starting a zonal shift helps your application recover quickly, for example, from a developer's bad code deployment or from an Amazon impairment in a single AZ. Shifting traffic away from the impaired AZ reduces the impact for clients who are using your application in the impaired AZ.
You can start a zonal shift for any supported resource in your account in an Amazon Region. Zonal shifts are manual and temporary. When you start a zonal shift, you must specify an (extendable) expiration of up to three days. To enable zonal shift for supported resources, refer to Supported resources.
Zonal autoshift
ARC zonal autoshift authorizes Amazon to shift traffic away from an impaired AZ for supported resources, on your behalf, to healthy AZs in the same Amazon Region. Amazon starts a zonal autoshift when internal telemetry indicates that there is an impairment in one AZ in an Amazon Region that could potentially impact customers. The internal telemetry incorporates metrics from multiple sources, including the Amazon network, and the Amazon EC2 and Elastic Load Balancing services.
Zonal autoshifts are temporary. Amazon ends a zonal autoshift when the internal telemetry indicators show that there is no longer an issue or potential issue.
To learn more about these capabilities, see the following chapters:
Multi-Region recovery
Routing control
ARC's extremely reliable routing controls enable multi-Region recovery so that your applications can failover Domain Name System DNS traffic across Amazon Regions.
If your application is designed to operate out of multiple Amazon Regions, you can use ARC routing control to failover between Regions. Routing control enables you to failover traffic from an impaired Amazon Region to a healthy Amazon Region, so that you can ensure that your application maintains availability. Routing control includes safety rules, which help protect you from unintended outcomes by imposing guardrails that you define. For example, you can impose a safety rule that only one of your application replicas, active or standby, is enables and in use.
Readiness check
ARC readiness check continually monitors Amazon resource quotas, capacity, and network routing policies, and can notify you about changes that may affect your ability to failover to a replica application and recover from Region impairment. Continual readiness checks ensure that you can maintain your multi-Region applications in a state that is scaled and configured to handle failover traffic. Readiness check is useful when you first configure ARC, and during normal application operation. Readiness check is not intended to be used in the critical path for failover during an event.
To learn more about these capabilities, see the following chapters: