Multi-model Cascading Inference

The basic procedure of multi-model inference is similar to that of single-model inference. For details, see Model Inference.

The differences are as follows:

  • The model loading API needs to be called separately for each model in multi-model scenarios. For details about how to load a model, see Loading a Model.
  • If you need to run more models, call the model execution APIs separately for each model. For details about how to run a model, see Executing a Model.

    Call acl.mdl.execute to run synchronous model inference.