昇腾社区首页
中文
注册

vcgadd/vcgmax/vcgmin

功能说明

以block(32Byte)为单位完成向量的累加,每次迭代计算8个block,输出为每个block的累加结果,共8个值,连续写入dst地址(half类型占16Bytes,float类型占32Bytes)。

函数原型

void vcgadd(__ubuf__ half *dst, __ubuf__ half *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride); 
void vcgadd(__ubuf__ float *dst, __ubuf__ float *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride);

流水类型

PIPE_V

同类型接口

void vcgmax(__ubuf__ half *dst, __ubuf__ half *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride); 
void vcgmin(__ubuf__ half *dst, __ubuf__ half *src, uint8_t repeat, uint16_t dstRepeatStride, uint16_t src0Stride, uint16_t src1Stride);