使用create-job-queue
命令创建 SageMaker 训练作业队列。
以下示例创建了一个使用服务环境的基本 SageMaker 训练作业队列:
aws batch create-job-queue \
--job-queue-name my-sm-training-fifo-jq \
--job-queue-type SAGEMAKER_TRAINING \
--priority 1 \
--service-environment-order order=1,serviceEnvironment=ExampleServiceEnvironment
ExampleServiceEnvironment
替换为服务环境的名称。
该命令返回的输出类似于下方内容:
{
"jobQueueName": "my-sm-training-fifo-jq",
"jobQueueArn": "arn:aws:batch:region
:account
:job-queue/my-sm-training-fifo-jq"
}
创建任务队列后,请验证该队列是否已成功创建且处于有效状态。
使用describe-job-queues
命令查看有关您的任务队列的详细信息:
aws batch describe-job-queues --job-queues my-sm-training-fifo-jq
该命令返回的输出类似于下方内容:
{
"jobQueues": [
{
"jobQueueName": "my-sm-training-fifo-jq",
"jobQueueArn": "arn:aws:batch:region
:account
:job-queue/my-sm-training-fifo-jq",
"state": "ENABLED",
"status": "VALID",
"statusReason": "JobQueue Healthy",
"priority": 1,
"computeEnvironmentOrder": [],
"serviceEnvironmentOrder": [
{
"order": 1,
"serviceEnvironment": "arn:aws:batch:region
:account
:service-environment/ExampleServiceEnvironment
"
}
],
"jobQueueType": "SAGEMAKER_TRAINING",
"tags": {},
"jobStateTimeLimitActions": []
}
]
}
请确保:
-
这state
是 ENABLED
-
这status
是 VALID
-
这statusReason
是 JobQueue Healthy
-
这jobQueueType
是 SAGEMAKER_TRAINING
-
它们serviceEnvironmentOrder
引用了你的服务环境