DescribeInferenceComponent - Amazon SageMaker

DescribeInferenceComponent

Returns information about an inference component.

Request Syntax

{ "InferenceComponentName": "string" }

Request Parameters

For information about the parameters that are common to all actions, see Common Parameters.

The request accepts the following data in JSON format.

InferenceComponentName

The name of the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$

Required: Yes

Response Syntax

{ "CreationTime": number, "EndpointArn": "string", "EndpointName": "string", "FailureReason": "string", "InferenceComponentArn": "string", "InferenceComponentName": "string", "InferenceComponentStatus": "string", "LastModifiedTime": number, "RuntimeConfig": { "CurrentCopyCount": number, "DesiredCopyCount": number }, "Specification": { "ComputeResourceRequirements": { "MaxMemoryRequiredInMb": number, "MinMemoryRequiredInMb": number, "NumberOfAcceleratorDevicesRequired": number, "NumberOfCpuCoresRequired": number }, "Container": { "ArtifactUrl": "string", "DeployedImage": { "ResolutionTime": number, "ResolvedImage": "string", "SpecifiedImage": "string" }, "Environment": { "string" : "string" } }, "ModelName": "string", "StartupParameters": { "ContainerStartupHealthCheckTimeoutInSeconds": number, "ModelDataDownloadTimeoutInSeconds": number } }, "VariantName": "string" }

Response Elements

If the action is successful, the service sends back an HTTP 200 response.

The following data is returned in JSON format by the service.

CreationTime

The time when the inference component was created.

Type: Timestamp

EndpointArn

The Amazon Resource Name (ARN) of the endpoint that hosts the inference component.

Type: String

Length Constraints: Minimum length of 20. Maximum length of 2048.

Pattern: arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:endpoint/.*

EndpointName

The name of the endpoint that hosts the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

FailureReason

If the inference component status is Failed, the reason for the failure.

Type: String

Length Constraints: Maximum length of 1024.

InferenceComponentArn

The Amazon Resource Name (ARN) of the inference component.

Type: String

Length Constraints: Minimum length of 20. Maximum length of 2048.

InferenceComponentName

The name of the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$

InferenceComponentStatus

The status of the inference component.

Type: String

Valid Values: InService | Creating | Updating | Failed | Deleting

LastModifiedTime

The time when the inference component was last updated.

Type: Timestamp

RuntimeConfig

Details about the runtime settings for the model that is deployed with the inference component.

Type: InferenceComponentRuntimeConfigSummary object

Specification

Details about the resources that are deployed with this inference component.

Type: InferenceComponentSpecificationSummary object

VariantName

The name of the production variant that hosts the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

Errors

For information about the errors that are common to all actions, see Common Errors.

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: