ModelLatencyThreshold - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

ModelLatencyThreshold

The model latency threshold.

Contents

Percentile

The model latency percentile threshold. Acceptable values are P95 and P99. For custom load tests, specify the value as P95.

Type: String

Length Constraints: Maximum length of 64.

Required: No

ValueInMilliseconds

The model latency percentile value in milliseconds.

Type: Integer

Required: No

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following: