Restrictions
The concepts of 8K and non-8K in this section are described as follows:
- Non-8K: Width x Height is in the range of [10 x 6, 4096 x 4096].
- 8K: The width or height is in the range of (4096, 8192].
Resolution Restrictions
- Input image resolution. Input images with the following resolutions are supported:
The APIs for implementing VPC functions have different resolution requirements. For details, see the API descriptions in VPC. When the input image format is YUV440SP or YUV440P, the maximum width of the image is 4096.
- Output resolution range:
Version
Resolution Range
Atlas 200/300/500 Inference Product
Atlas Training Series Product
10 * 6 ~ 4096 * 4096
Restrictions on Image Formats, Width and Height Alignment, and Buffers
During image processing by VPC, acl.media.dvpp_malloc and acl.media.dvpp_free are called to allocate and free the input and output buffers on the device. The buffer lifetime is managed by the user.
- For details about the restrictions on the input image formats and width and height alignment, see Table 1.
- For details about the restrictions on the output image formats and width and height alignment, see Table 2.
For the Atlas 200/300/500 Inference Product, in the 8K scenario, the input image format must be YUV420SP NV12 8-bit or YUV420SP NV21 8-bit.
For the Atlas Training Series Product, in the 8K scenario, the input image format must be YUV420SP NV12 8-bit or YUV420SP NV21 8-bit.
To implement VPC functions with related APIs:
- For details about the definition of the image format, see acldvppPixelFormat. For details about the concepts such as width stride, height stride, and 8K see Terminology.
- The minimum and maximum values of the width stride and height stride vary according to the version. For details, see the following:
- Atlas 200/300/500 Inference Product
- The minimum width stride is 32, and the maximum width stride is 16384 (16384 = 4096 x 4, for the ARGB format, each pixel occupies 4 bytes).
- The minimum height stride is 6 and the maximum height stride is 16384.
- Atlas Training Series Product
- The minimum width stride is 32, and the maximum width stride is 16384 (16384 = 4096 x 4, for the ARGB format, each pixel occupies 4 bytes).
- The minimum height stride is 6 and the maximum height stride is 16384.
- Atlas 200/300/500 Inference Product
|
Version |
Input Format |
Input Width and Height |
Input Width Stride, Height Stride, and Buffer Size Atlas 200/300/500 Inference Product Atlas Training Series Product |
|---|---|---|---|
|
General format, supported by all versions |
YUV400 8-bit |
No alignment requirement. |
Width stride: Round up the width to the nearest multiple of 16. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride x 3/2 |
|
YUV420SP NV12 8-bit |
Width: Must be a multiple of 2. Height: Must be a multiple of 2. |
Width stride: Round up the width to the nearest multiple of 16. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride x 3/2 |
|
|
YUV420SP NV21 8-bit |
|||
|
YUV422SP 8-bit |
Width: Must be a multiple of 2. Height: No alignment requirement. |
Width stride: Round up the width to the nearest multiple of 16. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride x 2 |
|
|
YVU422SP 8-bit |
|||
|
YUV444SP 8-bit |
No alignment requirement. |
Width stride: Round up the width to the nearest multiple of 16. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride x 3 |
|
|
YVU444SP 8-bit |
|||
|
YUV422 Packed YUYV 8-bit |
Width: Must be a multiple of 2. Height: No alignment requirement. |
Width stride: Round up the width to the nearest multiple of 16 and multiply the result by 2. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride |
|
|
YUV422 Packed UYVY 8-bit |
|||
|
YUV422 Packed YVYU 8-bit |
|||
|
YUV422 Packed VYUY 8-bit |
|||
|
YUV444 Packed 8-bit |
No alignment requirement. |
Width stride: Round up the width to the nearest multiple of 16 and multiply the result by 3. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride |
|
|
RGB888 |
|||
|
BGR888 |
|||
|
ARGB8888 |
No alignment requirement. |
Width stride: Round up the width to the nearest multiple of 16 and multiply the result by 4. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride |
|
|
ABGR8888 |
|||
|
RGBA8888 |
|||
|
BGRA8888 |
|
Version |
Output Format |
Output Width and Height |
Output Width Stride, Height Stride, and Buffer Size Atlas 200/300/500 Inference Product Atlas Training Series Product |
|---|---|---|---|
|
General format, supported by all versions |
YUV420SP NV12 8-bit |
Width: Must be a multiple of 2. Height: Must be a multiple of 2. |
Width stride: Round up the width to the nearest multiple of 16. Height stride: Round up the height to the nearest multiple of 2. Buffer size (in bytes) = Width stride x Height stride x 3/2 |
|
YUV420SP NV21 8-bit |
Restrictions on Image Cropping and Pasting
|
Product Version |
Restrictions on Image Cropping and Pasting |
|---|---|
|
Atlas 200/300/500 Inference Product |
|
|
Atlas Training Series Product |
|
Restrictions on Downsampling of YUV Images
VPC converts the input image format to YUV444 or RGB for internal processing based on the input or output image format. YUV444 or RGB output format has no restriction on the odd and even numbers of the width and height, while the YUV420SP or YUV422SP output format, which performs downsampling processing, has such restrictions due to the data format. Therefore, abnormal data may exist on the output image boarders.
The root cause is that during the calculation, when the output image is at an odd start point, the image is correct for the YUV444 format. However, when the YUV420SP format is downsampled, the odd and even lines share the same UV. In this case, the Y of the start line and the UV of the previous line form a new pixel and abnormal data is generated.