HCCL_LOGIC_SUPERPOD_ID

Description

For the SuperPoD networking of the Atlas A3 training products/Atlas A3 inference products, if the rank table file is not used to configure cluster resource information, you can use this environment variable to specify the ID of the supernode to which the process running on the current node belongs. In this way, a physical SuperPoD can be divided into multiple logical SuperPoDs.

This environment variable is set to a string of fewer than 128 characters. The default value is an empty string.

If it is not set, the value of Super Pod ID in the environment is used as the SuperPoD ID. You can run the npu-smi info -t spod-info -i id -c chip_id command to view the value of Super Pod ID.

Example

export HCCL_LOGIC_SUPERPOD_ID=super_pod_id_1

Restrictions

  • This environment variable applies only to the scenario where the rank table file is not used to configure cluster information in SuperPoD mode. If the rank table file is used, the configuration in the file is used preferentially.
  • This environment variable is used to divide a physical SuperPoD into multiple logical SuperPoDs. Ranks that belong to different physical SuperPoDs cannot be configured in the same logical SuperPoD.

Applicability

Atlas A3 training products/Atlas A3 inference products