SetHF32

Applicability

Product	Supported
Atlas A3 training products/Atlas A3 inference products	√
Atlas A2 training products/Atlas A2 inference products	√
Atlas 200I/500 A2 inference products	x
Atlas inference product's AI Core	x
Atlas inference product's Vector Core	x
Atlas training products	x

Function

Sets whether to enable the HF32 mode (data type that can be used for matrix multiplication) in Cube-only mode (only matrix computation). After the mode is enabled, the float32 data type is converted to the hf32 data type during matrix multiplication, which improves the computing performance but causes precision loss.

Prototype

__aicore__ inline void SetHF32(bool enableHF32 = false, int32_t transMode = 0)

Parameters

Parameter	Input/Output	Description
enableHF32	Input	Whether to enable the HF32 mode. The default value is false, indicating that the HF32 mode is disabled.
transMode	Input	ROUND mode used for converting float to hf32 when the HF32 mode is enabled. The default value is 0. 0: rounding to the nearest even number when the distance is equal. 1: rounding to the nearest number away from zero when the distance is equal.

Parameter

Input/Output

Description

enableHF32

Input

Whether to enable the HF32 mode. The default value is false, indicating that the HF32 mode is disabled.

transMode

Input

ROUND mode used for converting float to hf32 when the HF32 mode is enabled. The default value is 0.

0: rounding to the nearest even number when the distance is equal.

1: rounding to the nearest number away from zero when the distance is equal.

Returns

None

Restrictions

This API can be called only in Cube-only mode.

Example

// Cube-only mode
#define ASCENDC_CUBE_ONLY
REGIST_MATMUL_OBJ(&pipe, GetSysWorkSpacePtr(), mm, &tiling);    // A/B/C/BIAS is of the float type.
mm.SetTensorA(gm_a);
mm.SetTensorB(gm_b);
mm.SetBias(gm_bias);
mm.SetHF32(true);
mm.IterateAll(gm_c);
mm.SetHF32(false);

Parent topic: Matmul Kernel APIs