AWS::SageMaker::InferenceComponent InferenceComponentComputeResourceRequirements - Amazon CloudFormation
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

AWS::SageMaker::InferenceComponent InferenceComponentComputeResourceRequirements

Defines the compute resources to allocate to run a model, plus any adapter models, that you assign to an inference component. These resources include CPU cores, accelerators, and memory.

Syntax

To declare this entity in your Amazon CloudFormation template, use the following syntax:

Properties

MaxMemoryRequiredInMb

The maximum MB of memory to allocate to run a model that you assign to an inference component.

Required: No

Type: Integer

Minimum: 128

Update requires: No interruption

MinMemoryRequiredInMb

The minimum MB of memory to allocate to run a model that you assign to an inference component.

Required: No

Type: Integer

Minimum: 128

Update requires: No interruption

NumberOfAcceleratorDevicesRequired

The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Inferentia.

Required: No

Type: Number

Minimum: 1

Update requires: No interruption

NumberOfCpuCoresRequired

The number of CPU cores to allocate to run a model that you assign to an inference component.

Required: No

Type: Number

Minimum: 0

Update requires: No interruption