Multi-model Cascading Inference
The basic procedure of multi-model inference is similar to that of single-model inference as described in Model Inference.
The differences are as follows:
- If you need to load more models, call the model loading API separately for each model. For details about how to load a model, see Loading a Model.
- If you need to run more models, call the model execution API separately for each model. For details about how to run a model, see Running a Model.
Parent topic: Model Inference