Resource Monitoring

Function Highlights

The Ascend AI processor resources can be monitored in real time during training or inference job execution, including usage, temperature, voltage, memory, and allocation status in containers. It can also monitor the vNPU AI Core usage, total vNPU memory, and used vNPU memory. Currently, NPU Exporter can only monitor vNPU resources of Atlas inference product.

Required Component

NPU Exporter

Instructions

  1. Refer to Installation and Deployment for component installation.
  2. Refer to Before You Start for feature usage.