BatchReplaceClusterNodesError - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

BatchReplaceClusterNodesError

Represents an error encountered when replacing a node in a SageMaker HyperPod cluster.

Contents

ErrorCode

The error code associated with the error encountered when replacing a node.

Possible values:

  • InstanceIdNotFound: The instance does not exist in the specified cluster.

  • InvalidInstanceStatus: The instance is in a state that does not allow replacement. Wait for the instance to finish any ongoing changes before retrying.

  • InstanceIdInUse: Another operation is already in progress for this node. Wait for the operation to complete before retrying.

  • InternalServerError: An internal error occurred while processing this node.

Type: String

Valid Values: InstanceIdNotFound | InvalidInstanceStatus | InstanceIdInUse | InternalServerError

Required: Yes

Message

A human-readable message describing the error encountered when replacing a node.

Type: String

Required: Yes

NodeId

The EC2 instance ID of the node that encountered an error during the replacement operation.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 256.

Pattern: i-[a-f0-9]{8}(?:[a-f0-9]{9})?

Required: Yes

See Also

For more information about using this API in one of the language-specific Amazon SDKs, see the following: