Overview
In ONNX scenarios, only accuracy comparison of non-quantized models is supported. The following table lists the input data to be prepared.
File |
Description |
How to Obtain |
|---|---|---|
.npy file of the non-quantized original model |
Benchmark data |
|
.json file generated by converting the offline model file using ATC |
Operator mapping obtaining |
|
Dump data file of the non-quantized offline model running on the Ascend AI Processor |
Data to be compared |
In the offline inference scenario, the methods for obtaining the dump data of the NPU environment are the same for different frameworks. For details, see the following: |
Parent topic: Comparison Between GPU and NPU (ONNX Offline Inference)
