WaitGetTensorC
Function Description
When the GetTensorC asynchronous API is used to copy the result matrix from GM to UB and UB needs to perform vector computation later, WaitGetTensorC needs to be called for synchronization.
Prototype
1 | __aicore__ inline void WaitGetTensorC() |
Parameters
None
Returns
None
Availability
Precautions
None
Example
1 2 3 4 5 6 7 8 | // Asynchronous mode mm.template Iterate<false>(); ...... ...... for (int i = 0; i < singleM/baseM*singleN/baseN; ++i) { mm.GetTensorC<false>(ubCmatrix); mm.WaitGetTensorC(); ... Vector operations } |
Parent topic: Matmul