Inference Container Images - Amazon SageMaker AI
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Inference Container Images

SageMaker Neo now provides inference image URI information for ml_* targets. For more information see DescribeCompilationJob.

Based on your use case, replace the highlighted portion in the inference image URI template provided below with appropriate values.

aws_account_id.dkr.ecr.aws_region.amazonaws.com/xgboost-neo:latest

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-keras:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 2.2.4.

Replace instance_type with either cpu or gpu.

CPU or GPU instance types
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-mxnet:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.8.0.

Replace instance_type with either cpu or gpu.

Inferentia1
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-mxnet:fx_version-instance_type-py3

Replace aws_region with either us-east-1 or us-west-2.

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.5.1.

Replace instance_type with inf.

aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-onnx:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.5.0.

Replace instance_type with either cpu or gpu.

CPU or GPU instance types
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-pytorch:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.4, 1.5, 1.6, 1.7, 1.8, 1.12, 1.13, or 2.0.

Replace instance_type with either cpu or gpu.

Inferentia1
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-pytorch:fx_version-instance_type-py3

Replace aws_region with either us-east-1 or us-west-2.

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.5.1.

Replace instance_type with inf.

Inferentia2 and Trainium1
763104351884.dkr.ecr.aws_region.amazonaws.com/pytorch-inference-neuronx:1.13.1-neuronx-py38-sdk2.10.0-ubuntu20.04

Replace aws_region with us-east-2 for Inferentia2, and us-east-1 for Trainium1.

CPU or GPU instance types
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-tensorflow:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.15.3 or 2.9.

Replace instance_type with either cpu or gpu.

Inferentia1
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-tensorflow:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used. Note that for instance type inf only us-east-1 and us-west-2 are supported.

Replace fx_version with 1.15.0

Replace instance_type with inf.

Inferentia2 and Trainium1
763104351884.dkr.ecr.aws_region.amazonaws.com/tensorflow-inference-neuronx:2.10.1-neuronx-py38-sdk2.10.0-ubuntu20.04

Replace aws_region with us-east-2 for Inferentia2, and us-east-1 for Trainium1.

The following table maps aws_account_id with aws_region. Use this table to find the correct inference image URI you need for your application.

aws_account_id aws_region
785573368785 us-east-1
007439368137 us-east-2
710691900526 us-west-1
301217895009 us-west-2
802834080501 eu-west-1
205493899709 eu-west-2
254080097072 eu-west-3
601324751636 eu-north-1
966458181534 eu-south-1
746233611703 eu-central-1
110948597952 ap-east-1
763008648453 ap-south-1
941853720454 ap-northeast-1
151534178276 ap-northeast-2
925152966179 ap-northeast-3
324986816169 ap-southeast-1
355873309152 ap-southeast-2
474822919863 cn-northwest-1
472730292857 cn-north-1
756306329178 sa-east-1
464438896020 ca-central-1
836785723513 me-south-1
774647643957 af-south-1
275950707576 il-central-1