Network Health Diagnosis

Function

Diagnose the network health status and output the diagnosis result.

Table 1 Diagnostic items

Item

Time Required

Whether NPU Training or Inference Is Affected

Application Scenario

network

≤ 10 seconds

No

Faults occur on training or inference service inspection, rollout, or network.

Preparations

Before performing network diagnosis, you need to configure the IP address and subnet mask of the RoCE NIC as the root user on the host and then configure the IP address of the network detection object. Otherwise, the diagnosis may fail.

Parameters

Table 2 lists only a test-specific parameter. For details about other common parameters, see Common Parameters.

Table 2 Parameter description

Parameter

Description

Mandatory

[-i, --items]

Specifies the diagnosis check item.
  • network indicates the network health diagnosis.

Yes

Example

ascend-dmi -dg -i network

1
2
3
4
5
6
7
8
9
[***@***]# ascend-dmi -dg -i network
Summary:
    Arch: aarch64
    Mode: ******
    Time: 20250529-19:22:43
 
Hardware:
    network:
        PASS

Fault Check Items

Table 3 Fault check items

Command Output

Description

PASS

The network is healthy.

SKIP

The current product or scenario does not support this function.

INFO

Information displayed for the network.

WARN

Alarm generated for the network.

FAIL

The network check fails.