INT8

The differences between INT8 and SQ8 are as follows: INT8 is quantized externally, and the input feature of the index is of the int8 type. SQ8 is quantized internally, and the input feature of the index is of the float32 type.

Usage

python3 int8flat_generate_model.py -d <dim> --cores <core_num> -p <process_id> -pool <pool_size>

Parameter

<dim>: feature vector dimension (D).

<core_num>: number of AI Cores of the Ascend AI Processor. The default value is 2.

<process_id>: ID of the process for batch generating operator multi-process scheduling tasks.

<pool_size>: size of the process pool for multi-process scheduling of operators generated in batches.

Description

Run the command to obtain a group of operator model files. You need to modify the parameters in the command.

Restrictions

  • dim ∈ {64, 128, 256, 384, 512, 1024}
  • 0 ≤ pool_size ≤ 32