AWS::SageMaker::EndpointConfig AsyncInferenceClientConfig - Amazon CloudFormation
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

AWS::SageMaker::EndpointConfig AsyncInferenceClientConfig

Configures the behavior of the client used by SageMaker to interact with the model container during asynchronous inference.

Syntax

To declare this entity in your Amazon CloudFormation template, use the following syntax:

Properties

MaxConcurrentInvocationsPerInstance

The maximum number of concurrent requests sent by the SageMaker client to the model container. If no value is provided, SageMaker will choose an optimal value for you.

Required: No

Type: Integer

Update requires: Replacement