ProductionVariantServerlessUpdateConfig - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

ProductionVariantServerlessUpdateConfig

Specifies the serverless update concurrency configuration for an endpoint variant.

Contents

MaxConcurrency

The updated maximum number of concurrent invocations your serverless endpoint can process.

Type: Integer

Valid Range: Minimum value of 1. Maximum value of 200.

Required: No

ProvisionedConcurrency

The updated amount of provisioned concurrency to allocate for the serverless endpoint. Should be less than or equal to MaxConcurrency.

Type: Integer

Valid Range: Minimum value of 1. Maximum value of 200.

Required: No

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following: