What is Application Auto Scaling? - Application Auto Scaling
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

What is Application Auto Scaling?

Application Auto Scaling is a web service for developers and system administrators who need a solution for automatically scaling their scalable resources for individual Amazon services beyond Amazon EC2.

In the China (Beijing) Region, you can configure automatic scaling for the following resources:

  • Aurora replicas

  • DynamoDB tables and global secondary indexes

  • Amazon ECS services

  • Amazon EMR clusters

  • Amazon Keyspaces (for Apache Cassandra) tables

  • Lambda function provisioned concurrency

  • Amazon Managed Streaming for Apache Kafka (MSK) broker storage

  • Amazon Neptune clusters

  • SageMaker endpoint variants

  • Spot Fleet requests

  • Custom resources provided by your own applications or services. For more information, see the GitHub repository.

In the China (Ningxia) Region, you can configure automatic scaling for the following resources:

  • Aurora replicas

  • DynamoDB tables and global secondary indexes

  • Amazon ECS services

  • Amazon EMR clusters

  • Amazon Keyspaces (for Apache Cassandra) tables

  • Lambda function provisioned concurrency

  • Amazon Managed Streaming for Apache Kafka (MSK) broker storage

  • Amazon Neptune clusters

  • SageMaker endpoint variants

  • Spot Fleet requests

To see the regional availability for any of the Amazon services listed above, see the Region table.

For information about scaling your fleet of Amazon EC2 instances using Auto Scaling groups, see the Amazon EC2 Auto Scaling User Guide.

Features of Application Auto Scaling

Application Auto Scaling allows you to automatically scale your scalable resources according to conditions that you define.

  • Target tracking scaling – Scale a resource based on a target value for a specific CloudWatch metric.

  • Step scaling – Scale a resource based on a set of scaling adjustments that vary based on the size of the alarm breach.

  • Scheduled scaling – Scale a resource one time only or on a recurring schedule.

Work with Application Auto Scaling

You can configure scaling using the following interfaces depending on the resource that you are scaling:

  • Amazon Web Services Management Console – Provides a web interface that you can use to configure scaling. If you've signed up for an Amazon account, access Application Auto Scaling by signing into the Amazon Web Services Management Console. Then, open the service console for one of the resources listed in the introduction. Ensure that you open the console in the same Amazon Web Services Region as the resource that you want to work with.

    Note

    Console access is not available for all resources. For more information, see Amazon services that you can use with Application Auto Scaling.

  • Amazon Command Line Interface (Amazon CLI) – Provides commands for a broad set of Amazon Web Services, and is supported on Windows, macOS, and Linux. To get started, see Set up the Amazon CLI. For more information, see application-autoscaling in the Amazon CLI Command Reference.

  • Amazon Tools for Windows PowerShell – Provides commands for a broad set of Amazon products for those who script in the PowerShell environment. To get started, see the Amazon Tools for Windows PowerShell User Guide. For more information, see the Amazon Tools for PowerShell Cmdlet Reference.

  • Amazon SDKs – Provides language-specific API operations and takes care of many of the connection details, such as calculating signatures, handling request retries, and handling errors. For more information, see Amazon SDKs.

  • HTTPS API – Provides low-level API actions that you call using HTTPS requests. For more information, see the Application Auto Scaling API Reference.

  • Amazon CloudFormation – Supports configuring scaling using a CloudFormation template. For more information, see Create Application Auto Scaling resources with Amazon CloudFormation.

To connect programmatically to an Amazon Web Service, you use an endpoint. For information about endpoints for calls to Application Auto Scaling, see Endpoints and ARNs for Amazon Web Services in China in the Getting Started with Amazon Web Services in China.