Tuning in TensorFlow-based Online Inference Scenarios

For the Atlas A2 training products/Atlas A2 inference products, tuning in online inference is not supported.

For the Atlas A3 training products/Atlas A3 inference products, tuning in online inference is not supported.

Both subgraph tuning and operator tuning are supported in TensorFlow-based online inference scenarios. The online tuning process here is basically the same as that in TensorFlow-based training scenarios. For details, see Online Tuning in TensorFlow-based Training Scenarios.

The differences between tuning in TensorFlow-based online inference scenarios and online tuning in TensorFlow-based training scenarios are as follows:

  • During tuning by setting environment variables, the online inference script is executed for online inference, and the training script is executed for TensorFlow-based training.
  • During tuning by modifying the script, in online inference scenarios, only the sess.run mode is supported, and session configuration options aoe_mode and work_path are used to enable AOE tuning. Only TensorFlow 1.15 and TensorFlow 2.6.5 are supported.