vcgadd/vcgmax/vcgmin
功能说明
以block(32Byte)为单位完成向量的累加,每次迭代计算8个block,输出为每个block的累加结果,共8个值,连续写入dst地址(half类型占16Bytes,float类型占32Bytes)。
函数原型
void vcgadd(__ubuf__ half *dst, __ubuf__ half *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride); void vcgadd(__ubuf__ float *dst, __ubuf__ float *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride);
流水类型
PIPE_V
同类型接口
void vcgmax(__ubuf__ half *dst, __ubuf__ half *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride); void vcgmin(__ubuf__ half *dst, __ubuf__ half *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride);
父主题: 规约运算