Overview
This section describes the command-line options used by the AOE. An option and its argument can be separated by an equal sign (=) or a space. In this section, the equal sign (=) is used as an example.
If options queried with the aoe --help command are not described in Table 1, they are reserved or applicable to other SoC versions. You do not need to pay attention to such options.
Option |
Description |
Mandatory (Yes/No) |
Default Value |
|---|---|---|---|
Displays help information. |
No |
N/A |
|
Sets the model file directory, including the file name. |
No |
N/A |
|
Sets the model file directory, which can store multiple model files. |
No |
N/A |
|
Sets the weight file directory, including the file name. |
No |
N/A |
|
Sets the tuning mode. |
Yes |
N/A |
|
Sets the framework of the original model. |
No |
N/A |
|
Sets the input data format. |
No |
NCHW (Caffe and ONNX) NHWC (TensorFlow) |
|
Sets the shape of each input. |
No |
N/A |
|
Sets dynamic batch size profiles. Applies to the scenario where image count per inference batch is unfixed. |
No |
N/A |
|
Sets dynamic image size profiles. Applies to the scenario where the resolution of images input for inference is unfixed. |
No |
N/A |
|
Sets dynamic dimension profiles in ND format. Applies to the scenario where the dimensions for inference are unfixed. |
No |
N/A |
|
Reloads tuning after subgraph tuning is interrupted. After the current process is interrupted, if you want to continue tuning from the previous phase, run --reload to enter the reload mode. |
No |
N/A |
|
Specifies the device used for tuning in the operating environment. |
No |
N/A |
|
Enables or disables the function of displaying the tuning progress. |
No |
on |
|
Tunes one or more specified operators by configuring the operator description file. |
No |
N/A |
|
Sets the path of the tuned model, including the file name. |
No |
N/A |
|
Sets the output data type of a network or an output node. |
No |
N/A |
|
If the OS and architecture of the model compilation environment are inconsistent with those of the model operating environment, set this option to the OS type of the model operating environment. |
No |
N/A |
|
If the OS and its architecture of the model compilation environment are inconsistent with those of the model operating environment, set this option to the OS architecture of the model operating environment. |
No |
N/A |
|
Sets the number of AI Cores for model compilation. |
No |
The default value is the actual number of cores of the Ascend AI Processor. |
|
Indicates whether AOE can run on virtual devices generated on Ascend virtual instances. Availability:
|
No |
0 |
|
Sets the output nodes. |
No |
N/A |
|
Sets the input nodes to specify as FP16 nodes. |
No |
N/A |
|
Sets the path of the insertion operator configuration file, including the file name. |
No |
N/A |
|
Sets the path of the custom operator (non-standard operators) mapping configuration file, including the file name. |
No |
N/A |
|
Sets the data type and format of the network inputs to FP16 and NC1HWC0, respectively. |
No |
false |
|
Sets the data type and format of the network outputs to FP16 and NC1HWC0, respectively. |
No |
false |
|
Enables memory reuse. |
No |
0 |
|
Sets the fusion switch configuration file directory, including the file name. |
No |
N/A |
|
Enables specific fusion patterns during compilation. |
No |
N/A |
|
Sets whether to enable single-stream serial execution of model inference in the static shape scenario. Streams preserve the order of a stack of asynchronous operations being executed on the device. |
No |
false |
|
Enables small channel tuning to yield performance benefits at convolutional layers with channel size ≤ 4. |
No |
0 |
|
Sets the directory of the node list configuration file to be compressed, including the file name. |
No |
N/A |
|
Sets the directory (including the file name) of the compression configuration file. This option is used to enable the compression optimization feature specified in the configuration file to improve network performance. |
No |
N/A |
|
Enables buffer tuning. |
No |
l2_optimize |
|
Sets the precision mode of a model. |
No |
The default arguments are as follows:
|
|
Selects the operator implementation mode. |
No |
high_performance |
|
Lists operator optypes. |
No |
N/A |
|
Enables TBE operator debug during operator compilation. |
No |
0 |
|
Sets the log level during tuning. |
No |
N/A |
|
Specifies the operator name or operator type in the configuration file to tune a specified operator. |
No |
N/A |
|
Sets the precision mode of an operator. You can use this option to set different precision modes for different operators. |
No |
N/A |
|
Sets the operators on the mixed precision list. |
No |
N/A |
|
Keeps the computation precision of some operators unchanged during the building of the original network model. |
No |
N/A |
|
Customizes the computing precision of one or more operators during model building. |
No |
N/A |
|
Sets the tuning mode, including the high-performance mode and normal mode. |
No |
O2 |
|
Sets in-depth operator tuning. |
No |
N/A |
|
Sets non-uniform subgraph partition tuning. |
No |
N/A |
|
Sets operator Format tuning. |
No |
N/A |
|
Enables global sparsity. |
No |
0 |
|
Enables static kernel tuning, which generates the tuned kernel based on the input operator .json file and saves the kernel to a specified directory. |
No |
N/A |
|
Specifies the path for storing the operator .json file |
No |
N/A |
|
Specifies the path for storing the static kernel. |
No |
N/A |
|
Specifies the version of the Ascend AI Processor. |
No |
N/A |
|
Transparently transmits compilation options that are not detectable by the AOE tuning framework and tuning services in the modeling initialization phase. |
No |
N/A |
|
Transparently transmits compilation options that are not detectable by the AOE tuning framework and tuning services in the model compilation phase. |
No |
N/A |