RollingUpdatePolicy - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

RollingUpdatePolicy

Specifies a rolling deployment strategy for updating a SageMaker endpoint.

Contents

MaximumBatchSize

Batch size for each rolling step to provision capacity and turn on traffic on the new endpoint fleet, and terminate capacity on the old endpoint fleet. Value must be between 5% to 50% of the variant's total instance count.

Type: CapacitySize object

Required: Yes

WaitIntervalInSeconds

The length of the baking period, during which SageMaker monitors alarms for each batch on the new fleet.

Type: Integer

Valid Range: Minimum value of 0. Maximum value of 3600.

Required: Yes

MaximumExecutionTimeoutInSeconds

The time limit for the total deployment. Exceeding this limit causes a timeout.

Type: Integer

Valid Range: Minimum value of 600. Maximum value of 28800.

Required: No

RollbackMaximumBatchSize

Batch size for rollback to the old endpoint fleet. Each rolling step to provision capacity and turn on traffic on the old endpoint fleet, and terminate capacity on the new endpoint fleet. If this field is absent, the default value will be set to 100% of total capacity which means to bring up the whole capacity of the old fleet at once during rollback.

Type: CapacitySize object

Required: No

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following: