

# Inference Container Images


SageMaker Neo now provides inference image URI information for `ml_*` targets. For more information see [DescribeCompilationJob](https://docs.amazonaws.cn/sagemaker/latest/APIReference/API_DescribeCompilationJob.html#sagemaker-DescribeCompilationJob-response-InferenceImage).

Based on your use case, replace the highlighted portion in the inference image URI template provided below with appropriate values. 

## Amazon SageMaker AI XGBoost


```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/xgboost-neo:latest
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used.

## Keras


```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-keras:fx_version-instance_type-py3
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used.

Replace *fx\$1version* with `2.2.4`.

Replace *instance\$1type* with either `cpu` or `gpu`.

## MXNet


------
#### [ CPU or GPU instance types ]

```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-mxnet:fx_version-instance_type-py3
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used. 

Replace *fx\$1version* with `1.8.0`. 

Replace *instance\$1type* with either `cpu` or `gpu`. 

------
#### [ Inferentia1 ]

```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-mxnet:fx_version-instance_type-py3
```

Replace *aws\$1region* with either `us-east-1` or `us-west-2`. 

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used. 

Replace *fx\$1version* with `1.5.1`. 

Replace *`instance_type`* with `inf`.

------

## ONNX


```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-onnx:fx_version-instance_type-py3
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used.

Replace *fx\$1version* with `1.5.0`.

Replace *instance\$1type* with either `cpu` or `gpu`.

## PyTorch


------
#### [ CPU or GPU instance types ]

```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-pytorch:fx_version-instance_type-py3
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used. 

Replace *fx\$1version* with `1.4`, `1.5`, `1.6`, `1.7`, `1.8`, `1.12`, `1.13`, or `2.0`.

Replace *instance\$1type* with either `cpu` or `gpu`. 

------
#### [ Inferentia1 ]

```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-pytorch:fx_version-instance_type-py3
```

Replace *aws\$1region* with either `us-east-1` or `us-west-2`. 

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used. 

Replace *fx\$1version* with `1.5.1`. 

Replace *`instance_type`* with `inf`.

------
#### [ Inferentia2 and Trainium1 ]

```
763104351884.dkr.ecr.aws_region.amazonaws.com/pytorch-inference-neuronx:1.13.1-neuronx-py38-sdk2.10.0-ubuntu20.04
```

Replace *aws\$1region* with `us-east-2` for Inferentia2, and `us-east-1` for Trainium1.

------

## TensorFlow


------
#### [ CPU or GPU instance types ]

```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-tensorflow:fx_version-instance_type-py3
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used. 

Replace *fx\$1version* with `1.15.3` or `2.9`. 

Replace *instance\$1type* with either `cpu` or `gpu`. 

------
#### [ Inferentia1 ]

```
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-tensorflow:fx_version-instance_type-py3
```

Replace *aws\$1account\$1id* from the table at the end of this page based on the *aws\$1region* you used. Note that for instance type `inf` only `us-east-1` and `us-west-2` are supported.

Replace *fx\$1version* with `1.15.0`

Replace *instance\$1type* with `inf`.

------
#### [ Inferentia2 and Trainium1 ]

```
763104351884.dkr.ecr.aws_region.amazonaws.com/tensorflow-inference-neuronx:2.10.1-neuronx-py38-sdk2.10.0-ubuntu20.04
```

Replace *aws\$1region* with `us-east-2` for Inferentia2, and `us-east-1` for Trainium1.

------

The following table maps *aws\$1account\$1id* with *aws\$1region*. Use this table to find the correct inference image URI you need for your application. 


| aws\$1account\$1id | aws\$1region | 
| --- | --- | 
| 785573368785 | us-east-1 | 
| 007439368137 | us-east-2 | 
| 710691900526 | us-west-1 | 
| 301217895009 | us-west-2 | 
| 802834080501 | eu-west-1 | 
| 205493899709 | eu-west-2 | 
| 254080097072 | eu-west-3 | 
| 601324751636 | eu-north-1 | 
| 966458181534 | eu-south-1 | 
| 746233611703 | eu-central-1 | 
| 110948597952 | ap-east-1 | 
| 763008648453 | ap-south-1 | 
| 941853720454 | ap-northeast-1 | 
| 151534178276 | ap-northeast-2 | 
| 925152966179 | ap-northeast-3 | 
| 324986816169 | ap-southeast-1 | 
| 355873309152 | ap-southeast-2 | 
| 474822919863 | cn-northwest-1 | 
| 472730292857 | cn-north-1 | 
| 756306329178 | sa-east-1 | 
| 464438896020 | ca-central-1 | 
| 836785723513 | me-south-1 | 
| 774647643957 | af-south-1 | 
| 275950707576 | il-central-1 | 