Tuning in TensorFlow-based Online Inference Scenarios
Both subgraph tuning and operator tuning are supported in TensorFlow-based online inference scenarios. The online tuning process here is basically the same as that in TensorFlow-based training scenarios. For details, see Online Tuning in TensorFlow-based Training Scenarios.
The differences between tuning in TensorFlow-based online inference scenarios and online tuning in TensorFlow-based training scenarios are as follows:
- During tuning by setting environment variables, the online inference script is executed for online inference, and the training script is executed for TensorFlow-based training.
- During tuning by modifying the script, in online inference scenarios, only the sess.run mode is supported, and session configuration options aoe_mode and work_path are used to enable AOE tuning. Only TensorFlow 1.15 and TensorFlow 2.6.5 are supported.
Parent topic: Tuning in Online Inference Scenarios