Introduction

Online inference refers to real-time inference using a trained .pb model in TensorFlow. Compared with offline inference, online inference has higher timeliness requirements and is usually used for data center inference.

By referring to this section, you will be able to easily port TensorFlow-based inference applications to the Ascend platform.

Parent topic: Online Inference