EI0013 Execution_Error_ROCE_CQE
Symptom
An error CQE occurred during operator execution. Local information: server %s, device ID %s, device IP %s. Peer information: server %s, device ID %s, device IP %s.
Possible Cause
- The network between two devices is abnormal. For example, the network port is intermittently disconnected.
- The peer process exits abnormally in advance. As a result, the local end cannot receive the response from the peer end.
Solution
- Check whether the network devices between the two ends are abnormal.
- Check whether the peer process exits first. If yes, check the cause of the process exit.
父主题: HCCL Errors