On-Chip Memory Diagnosis
Function
Diagnose the on-chip memory and output the diagnosis result.
Item |
Time Required |
Whether NPU Training or Inference Is Affected |
Application Scenario |
|---|---|---|---|
On-chip memory diagnosis |
2–4 seconds |
No |
An on-chip memory ECC error occurs on the NPU during training or inference. |
- The on-chip memory stress test and on-chip memory diagnosis apply to different scenarios. For details, see Table 1. Perform the on-chip memory stress test or on-chip memory diagnosis as required.
- If you want to conduct the on-chip memory diagnosis, on-chip memory stress test, and on-chip memory high-risk address stress test at the same time, refer to One-Click On-Chip Memory Stress Test.
Parameters
Table 2 lists only a test-specific parameter. For details about other common parameters, see Common Parameters.
Parameter |
Description |
Mandatory |
|---|---|---|
[-i, --items] |
Specifies the diagnosis check item.
|
Yes |
Example
- hbm of the
Atlas A2 training product ascend-dmi -dg -i hbm
1 2 3 4 5 6 7 8 9
[***@***]# ascend-dmi -dg -i hbm Summary: Arch: aarch64 Mode: ****** Time: 20250529-19:25:25 Hardware: hbm: PASS
- chipMemory of the Atlas 300I Duo inference card
ascend-dmi -dg -i chipMemory
1 2 3 4 5 6 7 8 9
[***@***]# ascend-dmi -dg -i chipMemory Summary: Arch: aarch64 Mode: ****** Time: 20250529-19:25:25 Hardware: chipMemory: PASS
Fault Check Items
Command Output |
Description |
|---|---|
PASS |
The on-chip memory check is passed and no exception occurs. |
SKIP |
The product or scenario does not support on-chip memory detection. |
GENERAL_WARN |
There are historical isolation pages with multi-bit errors. 0x80E18401 is generated to warn NPU health management faults. If the number of these pages falls within the range of [16, 64), the normal operation does not affected. NOTE:
When the diagnostic item is chipMemory, no alarm of this severity is generated. |
EMERGENCY_WARN |
|
FAIL |
The on-chip memory check fails. Contact Huawei technical support or locate the fault by referring to the FAQs. |