INT8
The differences between INT8 and SQ8 are as follows: INT8 is quantized externally, and the input feature of the index is of the int8 type. SQ8 is quantized internally, and the input feature of the index is of the float32 type.
Usage |
python3 int8flat_generate_model.py -d <dim> --cores <core_num> -p <process_id> -pool <pool_size> |
|---|---|
Parameter |
<dim>: feature vector dimension (D). <core_num>: number of AI Cores of the Ascend AI Processor. The default value is 2. <process_id>: ID of the process for batch generating operator multi-process scheduling tasks. <pool_size>: size of the process pool for multi-process scheduling of operators generated in batches. |
Description |
Run the command to obtain a group of operator model files. You need to modify the parameters in the command. |
Restrictions |
|