InferenceMetrics - Amazon SageMaker

Contents See Also

InferenceMetrics

The metrics for an existing endpoint compared in an Inference Recommender job.

Contents

MaxInvocations

The expected maximum number of requests per minute for the instance.

Type: Integer

Required: Yes

ModelLatency

The expected model latency at maximum invocations per minute for the instance.

Type: Integer

Required: Yes

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following:

Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

InferenceHubAccessConfig

InferenceRecommendation