interface InferenceComponentSpecificationProperty
Language | Type name |
---|---|
.NET | Amazon.CDK.AWS.Sagemaker.CfnInferenceComponent.InferenceComponentSpecificationProperty |
Go | github.com/aws/aws-cdk-go/awscdk/v2/awssagemaker#CfnInferenceComponent_InferenceComponentSpecificationProperty |
Java | software.amazon.awscdk.services.sagemaker.CfnInferenceComponent.InferenceComponentSpecificationProperty |
Python | aws_cdk.aws_sagemaker.CfnInferenceComponent.InferenceComponentSpecificationProperty |
TypeScript | aws-cdk-lib » aws_sagemaker » CfnInferenceComponent » InferenceComponentSpecificationProperty |
Details about the resources to deploy with this inference component, including the model, container, and compute resources.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_sagemaker as sagemaker } from 'aws-cdk-lib';
const inferenceComponentSpecificationProperty: sagemaker.CfnInferenceComponent.InferenceComponentSpecificationProperty = {
computeResourceRequirements: {
maxMemoryRequiredInMb: 123,
minMemoryRequiredInMb: 123,
numberOfAcceleratorDevicesRequired: 123,
numberOfCpuCoresRequired: 123,
},
// the properties below are optional
container: {
artifactUrl: 'artifactUrl',
deployedImage: {
resolutionTime: 'resolutionTime',
resolvedImage: 'resolvedImage',
specifiedImage: 'specifiedImage',
},
environment: {
environmentKey: 'environment',
},
image: 'image',
},
modelName: 'modelName',
startupParameters: {
containerStartupHealthCheckTimeoutInSeconds: 123,
modelDataDownloadTimeoutInSeconds: 123,
},
};
Properties
Name | Type | Description |
---|---|---|
compute | IResolvable | Inference | The compute resources allocated to run the model assigned to the inference component. |
container? | IResolvable | Inference | Defines a container that provides the runtime environment for a model that you deploy with an inference component. |
model | string | The name of an existing SageMaker model object in your account that you want to deploy with the inference component. |
startup | IResolvable | Inference | Settings that take effect while the model container starts up. |
computeResourceRequirements
Type:
IResolvable
|
Inference
The compute resources allocated to run the model assigned to the inference component.
container?
Type:
IResolvable
|
Inference
(optional)
Defines a container that provides the runtime environment for a model that you deploy with an inference component.
modelName?
Type:
string
(optional)
The name of an existing SageMaker model object in your account that you want to deploy with the inference component.
startupParameters?
Type:
IResolvable
|
Inference
(optional)
Settings that take effect while the model container starts up.