Amazon Deep Learning Containers for PyTorch 2.6 ARM64 Inference on SageMaker - Amazon Deep Learning Containers
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Amazon Deep Learning Containers for PyTorch 2.6 ARM64 Inference on SageMaker

Amazon Deep Learning Containers (DLCs) for Amazon SageMaker are now available for ARM64 platforms, including Amazon Graviton instance types with support for PyTorch 2.6. You can launch the new versions of the DLC on SageMaker.

This release includes a container image for inference on CPU, optimized for performance and scale on Amazon. This Docker image was tested on SageMaker. It provides an optimized user experience for running deep learning workloads on SageMaker. All software components in this image are scanned for security vulnerabilities and updated or patched in accordance with Amazon Security best practices.

A list of available containers can be found in our documentation. Please refer to the SageMaker Graviton blog and DLC developer guide to migrate the Deep Learning workloads to Graviton instances. You can also subscribe to our discussion forum to get launch announcements and post your questions.

Release Notes

  • Introduced container for PyTorch 2.6.0 for inference supporting SageMaker services on ARM64 instances. For details about this release, check out our GitHub release tag.

  • Starting with PyTorch 2.6, we are removing Conda from the DLCs and installing all Python packages from PyPI.

  • TorchServe version: 0.12.0

  • Please refer to the official PyTorch 2.6.0 release notes here for the full description of framework updates.

Performance Improvements

These DLCs continue to deliver the best performance on Graviton for BERT and RoBERTa sentiment analysis and fill mask models, making Graviton3 the most cost effective CPU platform on the Amazon cloud for these models. For more information, please refer to the Graviton PyTorch User Guide.

Security Advisory

Amazon recommends that customers monitor critical security updates in the Amazon Security Bulletin.

Python 3.12 Support

Python 3.12 is supported in the PyTorch ARM64 Inference containers.

CPU Instance Type Support

The containers support Graviton CPU instance types supported under SageMaker.

Amazon Regions support

The containers are available in the following regions:

Region

Code

US East (Ohio)

us-east-2

US East (N. Virginia)

us-east-1

US West (Oregon)

us-west-2

US West (N. California)

us-west-1

AF South (Cape Town)

af-south-1

Asia Pacific (Hong Kong)

ap-east-1

Asia Pacific (Hyderabad)

ap-south-2

Asia Pacific (Mumbai)

ap-south-1

Asia Pacific (Osaka)

ap-northeast-3

Asia Pacific (Seoul)

ap-northeast-2

Asia Pacific (Tokyo)

ap-northeast-1

Asia Pacific (Melbourne)

ap-southeast-4

Asia Pacific (Jakarta)

ap-southeast-3

Asia Pacific (Sydney)

ap-southeast-2

Asia Pacific (Singapore)

ap-southeast-1

Asia Pacific (Malaysia)

ap-southeast-5

Central (Canada)

ca-central-1

Canada (Calgary)

ca-west-1

EU (Zurich)

eu-central-2

EU (Frankfurt)

eu-central-1

EU (Ireland)

eu-west-1

EU (London)

eu-west-2

EU( Paris)

eu-west-3

EU (Spain)

eu-south-2

EU (Milan)

eu-south-1

EU (Stockholm)

eu-north-1

Israel (Tel Aviv)

il-central-1

Middle East (Bahrain)

me-south-1

Middle East (UAE)

me-central-1

SA (Sau Paulo)

sa-east-1

China (Beijing)

cn-north-1

China (Ningxia)

cn-northwest-1

Build and Test

  • Built on: c6g.2xlarge

  • Tested on: c8g.4xlarge, t4g.2xlarge, r8g.2xlarge, m7g.4xlarge, g5g.4xlarge

Known Issues

  • None

For latest updates, please refer to the aws/deep-learning-containers GitHub repo.