Elastic Fabric Adapter - Amazon ParallelCluster
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Elastic Fabric Adapter

Elastic Fabric Adapter (EFA) is a network device that has OS-bypass capabilities for low-latency network communications with other instances on the same subnet. EFA is exposed by using Libfabric, and can be used by applications using the Messaging Passing Interface (MPI).

To use EFA with Amazon ParallelCluster, add the line enable_efa = true to the [queue] section.

To view the list of EC2 instances that support EFA, see Supported instance types in the Amazon EC2 User Guide for Linux Instances.

For more information about the enable_efa setting, see enable_efa in the [queue] section.

A cluster placement group should be used to minimize latencies between instances. For more information, see placement and placement_group.

For more information, see Elastic Fabric Adapter in the Amazon EC2 User Guide and Scale HPC workloads with elastic fabric adapter and Amazon ParallelCluster in the Amazon Open Source Blog.

Note

By default, Ubuntu distributions enable ptrace (process trace) protection. Starting with Amazon ParallelCluster 2,6.0, ptrace protection is disabled so that Libfabric works properly. For more information, see Disable ptrace protection in the Amazon EC2 User Guide.

Note

Support EFA on Arm-based Graviton2 instances was added in Amazon ParallelCluster version 2.10.1.