Fusion Support
Currently, this tool mainly implements the following forms of BN fusion (single-operators involved in the following fusion forms must meet the restrictions of Quantization):
- Conv+BN+Scale fusion: Before AMCT-based quantization, the "Convolution+BatchNorm+Scale" composite in the model is fused into "Conv+BN+Scale." The BatchNorm and Scale layers are removed.
- Deconv+BN+Scale fusion: Before AMCT-based quantization, the "Deconvolution+BatchNorm+Scale" composite in the model is fused into "Deconv+BN+Scale." The BatchNorm and Scale layers are removed.
- BN+Scale+Conv fusion: Supported only after PTQ. Before quantization, BN+Scale+Conv fusion is performed on the BatchNorm+Scale+Conv structure in the model. The BatchNorm and Scale layers after fusion are deleted.
- FC+BN+Scale fusion: This function is supported only after quantization. Before quantization, FC+BN+Scale fusion is performed on the InnerProduct+BatchNorm+Scale structure in the model. After the fusion, the BatchNorm and Scale layers are deleted.
Parent topic: See Also