InferenceComponentSpecification - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

InferenceComponentSpecification

Details about the resources to deploy with this inference component, including the model, container, and compute resources.

Contents

ComputeResourceRequirements

The compute resources allocated to run the model assigned to the inference component.

Type: InferenceComponentComputeResourceRequirements object

Required: Yes

Container

Defines a container that provides the runtime environment for a model that you deploy with an inference component.

Type: InferenceComponentContainerSpecification object

Required: No

ModelName

The name of an existing SageMaker model object in your account that you want to deploy with the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?

Required: No

StartupParameters

Settings that take effect while the model container starts up.

Type: InferenceComponentStartupParameters object

Required: No

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following: