General Description

API Differences Between Versions

  • In this document, the media data processing APIs in the V1 and V2 versions have the same functions. Both of them are used to implement functions such as image cropping, image resizing, and format conversion. However, these two sets of APIs must not be mixed together.
    • V2 APIs are recommended, which guarantee continuous evolution of API functions and services in later versions.
    • V1 APIs are retained for backward compatibility considerations, but will be deprecated in later versions.

Function Description

Table 1 describes the functions of media data processing V1 (Digital Vision Pre-Processing, DVPP).

Table 1 Function description

Function

Description

Vision Preprocessing Core (VPC)

Processes images, including cropping, resizing, and format conversion. For details, see Function Description.

JPEG Decoder (JPEGD)

Decodes .jpg, .jpeg, .JPG, and .JPEG images into YUV images. For details, see Functions and Restrictions.

JPEG Encoder (JPEGE)

Encodes YUV images into .jpg images. For details, see Functions and Restrictions.

Video Decoder (VDEC)

Decodes videos. For details, see Functions and Restrictions.

Video Encoder (VENC)

Encodes videos. For details, see Functions and Restrictions.

PNG Decoder (PNGD)

Decodes PNG images. For details, see Functions and Restrictions.

Function Support

Table 2 describes the functions of media data processing V1 supported by products.

The meanings of the identifiers are as follows:
  • √: Supported
  • x: Not supported
Table 2 Function Support

Product

VPC

JPEGD

JPEGE

PNGD

VDEC

VENC

Atlas 200/300/500 Inference Product

Atlas Training Series Product

x

Restrictions

When using the APIs described in this chapter, pay attention to the following points:

  • About asynchronous APIs

    For the asynchronous APIs described in this chapter, a successful API call only indicates the success of the task delivery, regardless of the execution result. For dependent APIs, you are advised to specify the same stream for multiple APIs to ensure tasks execution sequence, because tasks in the same stream are executed in accordance with the API call sequence.

    When asynchronous APIs are called to decode, crop, and resize images, if tasks depend on each other, call acl.rt.synchronize_stream to ensure that tasks in the stream are executed in sequence.

    To ensure the performance, it is recommended that acl.rt.synchronize_stream be called once after multiple asynchronous media data processing tasks are delivered to a stream.

    After an asynchronous API is called, resources cannot be destroyed immediately. You need to call a synchronization API (for example, acl.rt.synchronize_stream) to confirm that the requested tasks of the device have been completed.

  • About memory allocation and deallocation

    1. If device memory is needed to store the input or output data before implementing the VPC, JPEGD, and JPEGE functions for media data processing, call acl.media.dvpp_malloc to allocate memory and acl.media.dvpp_free to free up memory.
    2. The memory allocated in 1 can be used for media data processing and other tasks. For example, the output of media data processing can be used as the input of model inference to implement memory reuse and reduce memory copy.
    3. Because the address space accessed by media data processing is limited, you are advised to call acl.rt.malloc, acl.rt.malloc_host, or acl.rt.malloc_cached described in section "Memory Management" to allocate memory for other functions (for example, model loading) to ensure sufficient memory during media data processing.
  • About channel requirements

    Before implementing each function of media data processing, you must call APIs to create corresponding channels. For details, see Channel Creation and Destruction. Channel creation and destruction involve resource allocation and release. Repeated channel creation and destruction, however, affect service performance. Therefore, you are advised to manage channels based on your actual scenario. For example, to process VPC images continuously, create VPC channels, wait until all VPC functions are called, and then destroy the VPC channels.

    A too large number of channels would affect the CPU usage and memory usage of the device. For details about the recommended number of channels, see the performance specifications in the corresponding function sections.