Case Description
A certain multimodal model experienced a sudden significant performance degradation during training. We will perform performance tuning based on the process described above.
Use Ascend PyTorch Profiler to collect profile data during LLM training. This case involves a cluster with 16 cards.
Parent topic: Performance Tuning Cases