Introduction

Online inference refers to real-time inference using a trained .pb model in TensorFlow. Compared with offline inference, online inference has higher timeliness requirements and is usually used for data center inference.

You can easily port TensorFlow-based inference applications to the Ascend platform by referring to this chapter.