Introduction
Online inference refers to real-time inference using a trained .pb model in TensorFlow. Compared with offline inference, online inference has higher timeliness requirements and is usually used for data center inference.
You can easily port TensorFlow-based inference applications to the Ascend platform by referring to this chapter.
Parent topic: Online Inference