After Volcano Is Manually Installed, the Pod Status Is CrashLoopBackOff

Symptom

When the Volcano is running, its pod status is CrashLoopBackOff.

Example:

View the logs of the pod corresponding to Volcano.
  • The error message "permission denied" is displayed.

  • Waiting for streamwatcher.go times out.

Possible Causes

During manual installation, the Volcano log permission is incorrect.

Solution

  • Reset the owner group and permission of the Volcano log directory.
chown -R hwMindX:hwMindX /var/log/mindx-dl/volcano-*
chmod 750 /var/log/mindx-dl/volcano-*
chmod 640 /var/log/mindx-dl/volcano-*/*.log

  • Wait until the pod is restored or delete the faulty pod.