aclmdlLoadFromMemWithQ

Applicability

Product

Supported

Atlas A3 training products/Atlas A3 inference products

Atlas A2 training products/Atlas A2 inference products

Atlas 200I/500 A2 inference products

Atlas inference products

Atlas training products

Description

Loads an offline model (offline model adapted to the Ascend AI Processor) from memory. The inputs and outputs of the model are stored in queues. This API can be used to load only models with static-shape inputs.

Prototype

aclError aclmdlLoadFromMemWithQ(const void *model, size_t modelSize, uint32_t *modelId, const uint32_t *inputQ, size_t inputQNum, const uint32_t *outputQ, size_t outputQNum)

Parameters

Parameter

Input/Output

Description

model

Input

Pointer to the address of the memory that stores the model.

modelSize

Input

Size of the model data in the memory, in bytes.

modelId

Output

Pointer to the model ID.

Returns the model ID after the model is successfully loaded. The model ID is used for model identification in subsequent operations.

inputQ

Input

Pointer to the queue ID. Each model input corresponds to a queue ID.

inputQNum

Input

Number of input queues.

outputQ

Input

Pointer to the queue ID. Each model output corresponds to a queue ID.

outputQNum

Input

Number of output queues.

Returns

0 on success; else, failure. For details, see aclError.

Restrictions

The operations of loading, executing, and unloading a model must be performed in the same context. For details about how to create a context, see or . Before loading the model file, check whether the memory space is sufficient based on the size of the model file. If the memory space is insufficient, the application will be abnormal.

See Also

The aclmdlSetConfigOpt and aclmdlLoadWithConfig APIs are provided for model loading. The caller needs to set the attributes in the configuration object passed to the API call to decide how the model will be loaded and who will manage the memory.