What is Amazon Route 53 Application Recovery Controller? - Amazon Route 53 Application Recovery Controller
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

What is Amazon Route 53 Application Recovery Controller?

Amazon Route 53 Application Recovery Controller (Route 53 ARC) helps you prepare for and complete faster recovery for applications running on Amazon. Route 53 ARC provides two sets of capabilities: Multi-Availability Zone (AZ) recovery, which includes zonal shift and zonal autoshift, and multi-Region recovery, which includes routing control and readiness check. With Route 53 ARC, you can leverage highly-available recovery tools to quickly mitigate impairments that are impacting your multi-Region or multi-AZ applications. You can also use readiness check to gain insights into whether your applications and resources are prepared for recovery.

The Amazon Global Cloud Infrastructure provides fault tolerance and resilience, with each Amazon Web Services Region made up of multiple, fully-isolated Availability Zones. Route 53 ARC works within this Amazon structure to help your applications be resilient.

For Route 53 ARC, only zonal shift and zonal autoshift are available in the Beijing and Ningxia Regions. Zonal shift helps you manage and coordinate recovery for Amazon applications by shifting load balancer traffic away when there’s an issue in an Availability Zone. With zonal autoshift, Amazon starts an autoshift when internal telemetry indicates that there is an Availability Zone impairment that could potentially impact customers. The internal telemetry incorporates metrics from several sources, including the Amazon network, and the Amazon EC2 and Elastic Load Balancing services.

Multi-AZ recovery

If you have applications that are built to take advantage of Availability Zones in Amazon, you can quickly isolate and recover from AZ impairments using zonal shift. Zonal shift enables you to recover from Availability Zone (AZ) impairments, by temporarily moving traffic for a supported resource away from an AZ, to healthy AZs in the Amazon Web Services Region. Starting a zonal shift helps your application recover quickly, for example, from a developer's bad code deployment or from an Amazon impairment in a single Availability Zone. By moving traffic away, you reduce the impact for clients who are using your application when there's an issue in one AZ.

You can start a zonal shift for any supported resource in your account in a Region. Amazon services automatically register supported Amazon resources with zonal shift in Route 53 ARC, so that you can start a zonal shift at any time.

Zonal autoshift is a capability in Route 53 ARC that you can enable to authorize Amazon to shift traffic away from an AZ for supported resources, on your behalf, to healthy AZs in the Amazon Web Services Region. Amazon starts an autoshift when internal telemetry indicates that there is an impairment in one AZ in a Region that could potentially impact customers. The internal telemetry incorporates metrics from multiple sources, including the Amazon network, and the Amazon EC2 and Elastic Load Balancing services.

Zonal shifts and autoshifts are temporary. When you start a manual zonal shift, you must specify an (extendable) expiration, of up to three days initially. If you want to continue to keep traffic away from an AZ, you can update the zonal shift and set a new expiration. With zonal autoshift, Amazon ends an autoshift when indicators show that there is no longer an issue or potential issue.

To learn more about these capabilities, see the following chapters:

Multi-Region recovery

If you have an application that you've designed to operate out of another Amazon Web Services Region to continue operations you can use routing control for failover. Routing control enables you to fail over traffic from one Amazon Web Services Region to another when there's an issue, so that you can ensure that your application stays available. Routing control includes safety rules, which help protect you from unintended outcomes, by imposing guardrails that you define. Using these rules, you can make sure, for example that only one of your application replicas, active or standby, is enabled and in use at a time.

For multi-Region recovery, Route 53 ARC can help you fail over DNS traffic across Amazon Web Services Regions. The extremely reliable routing controls in Route 53 ARC enable you to recover your application by rerouting traffic away from a Region with an impairment to a healthy Region.

With readiness check, Route 53 ARC continually monitors Amazon resource quotas, capacity, and network routing policies, and can notify you about changes that would affect your ability to fail over to a replica and recover. Continual readiness checks help make sure, on an ongoing basis, that you can maintain your multi-Region applications in a state that is scaled and configured to handle failover traffic. Readiness check is useful when you first configure Route 53 ARC, and during normal application operation. Readiness check is not intended to be used in the critical path for failover during an event.

To learn more about these capabilities, see the following chapters: