TrafficRoutingConfig - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

TrafficRoutingConfig

Defines the traffic routing strategy during an endpoint deployment to shift traffic from the old fleet to the new fleet.

Contents

Type

Traffic routing strategy type.

  • ALL_AT_ONCE: Endpoint traffic shifts to the new fleet in a single step.

  • CANARY: Endpoint traffic shifts to the new fleet in two steps. The first step is the canary, which is a small portion of the traffic. The second step is the remainder of the traffic.

  • LINEAR: Endpoint traffic shifts to the new fleet in n steps of a configurable size.

Type: String

Valid Values: ALL_AT_ONCE | CANARY | LINEAR

Required: Yes

WaitIntervalInSeconds

The waiting time (in seconds) between incremental steps to turn on traffic on the new endpoint fleet.

Type: Integer

Valid Range: Minimum value of 0. Maximum value of 3600.

Required: Yes

CanarySize

Batch size for the first step to turn on traffic on the new endpoint fleet. Value must be less than or equal to 50% of the variant's total instance count.

Type: CapacitySize object

Required: No

LinearStepSize

Batch size for each step to turn on traffic on the new endpoint fleet. Value must be 10-50% of the variant's total instance count.

Type: CapacitySize object

Required: No

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following: