Quantization Configuration Parameters
Description |
QAT enable |
|---|---|
Type |
Boolean |
Value |
true or false |
Command-Line Options |
|
Recommended Value |
true |
Required/Optional |
Optional |
Description |
Activation quantization configuration |
|---|---|
Type |
Dictionary |
Value |
- |
Command-Line Options |
Includes the following parameters:
|
Recommended Value |
- |
Required/Optional |
Optional |
Description |
Weight quantization configuration |
|---|---|
Type |
Dictionary |
Value |
- |
Command-Line Options |
Includes the following parameters:
|
Recommended Value |
- |
Required/Optional |
Optional |
Description |
Weight quantization algorithm |
|---|---|
Type |
String |
Value |
- |
Command-Line Options |
|
Recommended Value |
arq_retrain |
Required/Optional |
Optional |
Description |
Whether to use different quantization factors for each channel. |
|---|---|
Type |
Boolean |
Value |
true or false |
Command-Line Options |
|
Recommended Value |
true |
Required/Optional |
Optional |
Description |
Batch number for quantization |
|---|---|
Type |
Integer |
Value |
Greater than 0 |
Command-Line Options |
Defaults to 1. You are advised to keep the calibration dataset size within 50 images. Calculate batch_num based on batch_size as follows: batch_num × batch_size = Calibration dataset size batch_size indicates the number of images per batch. |
Recommended Value |
1 |
Required/Optional |
Optional |
Description |
Fixed lower bound switch for the activation quantization algorithm |
|---|---|
Type |
Boolean |
Value |
true or false |
Command-Line Options |
|
Recommended Value |
Do not include this parameter. |
Required/Optional |
Optional |
Description |
Upper bound for the activation quantization algorithm |
|---|---|
Type |
Float |
Value |
clip_max > 0 Find the maximum max based on the distribution of the activation values at different layers. The recommended value range is [0.3 × max, 1.7 × max]. |
Command-Line Options |
If this parameter is included, the upper bound of the clipping-based activation quantization algorithm is fixed. If this parameter is not included, the upper bound is learned using the IFMR algorithm. |
Recommended Value |
Do not include this parameter. |
Required/Optional |
Optional |
Description |
Lower bound for the activation quantization algorithm |
|---|---|
Type |
Float |
Value |
clip_min < 0 Find the minimum min based on the distribution of the activation values at different layers. The recommended value range is [0.3 × min, 1.7 × min]. |
Command-Line Options |
If this parameter is included, the lower bound of the clipping-based activation quantization algorithm is fixed. If this parameter is not included, the lower bound is learned using the IFMR algorithm. |
Recommended Value |
Do not include this parameter. |
Required/Optional |
Optional |
Description |
Quantization bit width select |
|---|---|
Type |
String |
Value |
Currently, only INT8 (default) is supported. |
Command-Line Options |
Selects the quantization bit width during quantization. |
Recommended Value |
- |
Required/Optional |
Optional |