AWS::SageMaker::InferenceComponent InferenceComponentCapacitySize - Amazon CloudFormation
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

This is the new Amazon CloudFormation Template Reference Guide. Please update your bookmarks and links. For help getting started with CloudFormation, see the Amazon CloudFormation User Guide.

AWS::SageMaker::InferenceComponent InferenceComponentCapacitySize

Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy. You can specify your batches as either of the following:

  • A count of inference component copies

  • The overall percentage or your fleet

For a rollback strategy, if you don't specify the fields in this object, or if you set the Value parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.

Syntax

To declare this entity in your Amazon CloudFormation template, use the following syntax:

JSON

{ "Type" : String, "Value" : Integer }

YAML

Type: String Value: Integer

Properties

Type

Specifies the endpoint capacity type.

COPY_COUNT

The endpoint activates based on the number of inference component copies.

CAPACITY_PERCENT

The endpoint activates based on the specified percentage of capacity.

Required: Yes

Type: String

Allowed values: COPY_COUNT | CAPACITY_PERCENT

Update requires: No interruption

Value

Defines the capacity size, either as a number of inference component copies or a capacity percentage.

Required: Yes

Type: Integer

Minimum: 1

Update requires: No interruption