

本文属于机器翻译版本。若本译文内容与英语原文存在差异，则一律以英文原文为准。

# P6 实例的软件要求
<a name="p6-support-dlami"></a>

以下是在 P6 实例上运行 DLAMI 的详细要求。

**Topics**
+ [P6-B200 要求](#dlami-support-p6-b200)
+ [p6e-00 要求 GB2](#dlami-support-p6e-gb200)
+ [P6-B300 要求](#dlami-support-p6-b300)
+ [GPU 功能测试](#dlami-p6-testing)

## P6-B200 要求
<a name="dlami-support-p6-b200"></a>

操作 P6-B200 实例需要以下软件：


|  软件  |  最低版本要求  | 
| --- | --- | 
|   **Nvidia CUDA Toolkit**   |  12.8  | 
|   **Nvidia Driver**   |  R570  | 
|   **NVLINK 5**   |  R570  | 
|   **Linux 内**   |  6.1  | 
|   **Elastic Fabric Adapter（EFA）**   |  1.41.0  | 
|   **Amazon OFI NCCL 插件**   |  1.15.0  | 

## p6e-00 要求 GB2
<a name="dlami-support-p6e-gb200"></a>

运行 p6e-GB2 00 实例需要以下软件：


|  软件  |  最低版本要求  | 
| --- | --- | 
|   **Nvidia CUDA Toolkit**   |  12.8  | 
|   **Nvidia Driver**   |  R570  | 
|   **Linux 内**   |  6.12  | 
|   **Elastic Fabric Adapter（EFA）**   |  1.42.0  | 
|   **Amazon OFI NCCL 插件**   |  1.15.0  | 

## P6-B300 要求
<a name="dlami-support-p6-b300"></a>

操作 P6-B300 实例需要以下软件：


|  软件  |  最低版本要求  | 
| --- | --- | 
|   **Nvidia CUDA Toolkit**   |  13.0  | 
|   **Nvidia Driver**   |  R580  | 
|   **NVLINK 5**   |  R580  | 
|   **Linux 内**   |  6.1  | 
|   **Elastic Fabric Adapter（EFA）**   |  1.44.0  | 
|   **Amazon OFI NCCL 插件**   |  1.17.1  | 

## 确认 GPU 功能
<a name="dlami-p6-testing"></a>

 **要确认功能，请执行 GPUs以下操作：**

1. 运行以下 Nvidia GPU 设备查询测试。

   ```
   $ /usr/local/cuda/extras/demo_suite/deviceQuery
   ```

1. 确认设备查询测试的输出。以下是 p6-b200 的输出示例。

   ```
   /usr/local/cuda/extras/demo_suite/deviceQuery Starting...
   
    CUDA Device Query (Runtime API)
   
   Detected 8 CUDA Capable device(s)
   ...
   deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 12.8, CUDA Runtime Version = 12.8, NumDevs = 8, Device0 = NVIDIA B200, Device1 = NVIDIA B200, Device2 = NVIDIA B200, Device3 = NVIDIA B200, Device4 = NVIDIA B200, Device5 = NVIDIA B200, Device6 = NVIDIA B200, Device7 = NVIDIA B200
   Result = PASS
   ```

 **确认 NVIDIA Driver 功能是否正常：**

1. 运行 Nvidia 系统管理接口。

   ```
   $ nvidia-smi
   ```

1. 确认系统管理界面的输出。以下是 p6-b200 的输出示例。

   ```
   +-----------------------------------------------------------------------------------------+
   | NVIDIA-SMI 570.133.20             Driver Version: 570.133.20     CUDA Version: 12.8     |
   |-----------------------------------------+------------------------+----------------------+
   | GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
   | Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
   |                                         |                        |               MIG M. |
   |=========================================+========================+======================|
   |   0  NVIDIA B200                    Off |   00000000:51:00.0 Off |                    0 |
   | N/A   32C    P0            145W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   1  NVIDIA B200                    Off |   00000000:52:00.0 Off |                    0 |
   | N/A   30C    P0            140W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   2  NVIDIA B200                    Off |   00000000:62:00.0 Off |                    0 |
   | N/A   31C    P0            139W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   3  NVIDIA B200                    Off |   00000000:63:00.0 Off |                    0 |
   | N/A   29C    P0            139W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   4  NVIDIA B200                    Off |   00000000:75:00.0 Off |                    0 |
   | N/A   31C    P0            141W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   5  NVIDIA B200                    Off |   00000000:76:00.0 Off |                    0 |
   | N/A   31C    P0            141W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   6  NVIDIA B200                    Off |   00000000:86:00.0 Off |                    0 |
   | N/A   32C    P0            141W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
   |   7  NVIDIA B200                    Off |   00000000:87:00.0 Off |                    0 |
   | N/A   30C    P0            138W / 1000W |       0MiB / 183359MiB |      0%      Default |
   |                                         |                        |             Disabled |
   +-----------------------------------------+------------------------+----------------------+
                                                                                            
   +-----------------------------------------------------------------------------------------+
   | Processes:                                                                              |
   |  GPU   GI   CI              PID   Type   Process name                        GPU Memory |
   |        ID   ID                                                               Usage      |
   |=========================================================================================|
   |  No running processes found                                                             |
   +-----------------------------------------------------------------------------------------+
   ```

**注意**  
如果您遇到任何问题，请联系 Supp Amazon ort。