INT8Flat
The differences between INT8Flat and SQ8 are as follows: INT8 is quantized externally, and the input feature of the index is of the INT8 type. SQ8 is quantized internally, and the input feature of the index is of the Float32 type.
Usage |
python3 int8flat_generate_model.py -d <dim> --cores <core_num> -p <process_id> -pool <pool_size> -t <npu_type> -code <code_num> |
|---|---|
Parameter |
<dim>: feature vector dimension. The default value is 512. <core_num>: number of AI Cores of the Ascend AI Processor. The default value is 2. You do not need to configure this parameter. <process_id>: ID of the process for multi-process scheduling of operators generated in batches. The default value is 0, and you do not need to set this parameter. <pool_size>: size of the process pool for multi-process scheduling of operators generated in batches. The default value is 10. <npu_type>: hardware form.
<code_num>: database block size when the operator is called. The default value is 262144. If this parameter is not set, operators with code_num values are generated by default. --help | -h: help information. |
Description |
Run the command to obtain a group of operator model files. You need to modify the parameters in the command. |
Restrictions |
|