FlatAT
Currently, the FlatAT operator is used together with the IVF operator to accelerate the add and train functions of the IVF operator. The FlatAT operator cannot be directly called. The acceleration of add and train is specified by AscendIndexIVFConfig.useKmeansPP in the IVF. In this case, only the training job whose scale is smaller than 7,000,000 is supported.
Usage |
python3 flat_at_generate_model.py --cores <core_num> -d <dim> -c <code_num> -p <process_id> -t <npu_type> |
|---|---|
Parameter |
<dim>: input feature vector dimension. The default value is 64. <code_num>: number of base library features to be compared with the input features. The default value is 8192. <core_num>: number of AI Cores of the Ascend AI Processor. The default value is 2. If this parameter is not specified, set this parameter based on <npu_type>. When npu_type is set to 310, set <core_num> to 2. When npu_type is set to 310P, set <core_num> to 8. <process_id>: ID of the process for multi-process scheduling of operators generated in batches. The default value is 0, and you do not need to set this parameter. <npu_type>: hardware form. Currently, --help | -h: help information. |
Description |
Run the command to obtain a group of operator model files. The FlatAT operator is used in the IVF scenario to reduce the time required for train and add functions. |
Restrictions |
|