Memory (Memory Read/Write Bandwidth Rate)

The memory read/write bandwidth rate data collected from the UB/L1/L2/main memory is saved in Memory.csv. For details, see the field description in the following table.

Figure 1 Memory.csv file
See the following table for more details.
Table 1 Field description

Field

Description

block_id

Number of running task blocks, which corresponds to the number of cores configured during task running.

sub_block_id

Name and sequence number of each block used for task running.

aic_time(us)

Execution time (μs) on each AI Cube Core compute unit after the task is allocated to each AI Cube Core compute unit.

aic_total_cycles

Total number of cycles executed on each AI Cube Core compute unit after the task is allocated to each AI Cube Core compute unit.

aiv_time(us)

Execution time of each AI Vector Core compute unit after the task is allocated to the unit, in μs.

aiv_total_cycles

Total number of cycles executed on each AI Vector Core compute unit after the task is allocated to the unit.

aiv_ub_to_gm_bw(GB/s)

UB-to-GM write bandwidth (GB/s).

aiv_gm_to_ub_bw(GB/s)

GM-to-UB write bandwidth (GB/s).

aic_l1_read_bw(GB/s)

L1 read bandwidth (GB/s)

aic_l1_write_bw(GB/s)

L1 write bandwidth rate (GB/s).

ai*_main_mem_read_bw(GB/s)

Main memory read bandwidth (GB/s).

ai*_main_mem_write_bw(GB/s)

Main memory write bandwidth (GB/s).

aic_mte1_instructions

Number of MTE1 instructions.

aic_mte1_ratio

Ratio of cycles taken to execute MTE1 instructions to the total cycles.

ai*_mte2_instructions

Number of MTE2 instructions.

ai*_mte2_ratio

Ratio of cycles taken to execute MTE2 instructions to the total cycles.

ai*_mte3_instructions

Number of MTE3 instructions.

ai*_mte3_ratio

Ratio of cycles taken to execute MTE3 instructions to the total cycles.

read_main_memory_datas(KB)

Total amount of data read from the main memory.

write_main_memory_datas(KB)

Total amount of data written to the main memory.

GM_to_L1_datas(KB)

Amount of data transferred from GM to L1.

L1_to_GM_datas(KB)(estimate)

Estimated amount of data transferred from L1 to GM.

L0C_to_L1_datas(KB)

Amount of data transferred from L0C to L1.

L0C_to_GM_datas(KB)

Amount of data transferred from L0C to GM.

GM_to_UB_datas(KB)

Amount of data transferred from GM to UB.

UB_to_GM_datas(KB)

Amount of data transferred from UB to GM.

GM_to_L1_bw_usage_rate(%)

Bandwidth usage of the channel from GM to L1.

L1_to_GM_bw_usage_rate(%)(estimate)

Estimated bandwidth usage of the channel from L1 to GM.

L0C_to_L1_bw_usage_rate(%)

Bandwidth usage of the channel from L0C to L1.

L0C_to_GM_bw_usage_rate(%)

Bandwidth usage of the channel from L0C to GM.

GM_to_UB_bw_usage_rate(%)

Bandwidth usage of the channel from GM to UB.

UB_to_GM_bw_usage_rate(%)

Bandwidth usage of the channel from UB to GM.

Figure 2 Memory.csv file
See the following table for more details.
Table 2 Field description

Field

Description

aic_time(us)

Execution time of each AI Core compute unit after the task is allocated to the unit, in μs.

aic_total_cycles

Total number of cycles executed on each AI Core compute unit after the task is allocated to the unit.

aic_ub_to_gm_bw(GB/s)

UB-to-GM write bandwidth (GB/s).

aic_gm_to_ub_bw(GB/s)

GM-to-UB write bandwidth (GB/s).

aic_l1_read_bw(GB/s)

L1 read bandwidth (GB/s)

aic_l1_write_bw(GB/s)

L1 write bandwidth rate (GB/s).

aic_main_mem_read_bw(GB/s)

Main memory read bandwidth (GB/s).

aic_main_mem_write_bw(GB/s)

Main memory write bandwidth (GB/s).

aic_mte1_instructions

Number of MTE1 instructions.

aic_mte1_ratio

Ratio of cycles taken to execute MTE1 instructions to the total cycles.

aic_mte2_instructions

Number of MTE2 instructions.

aic_mte2_ratio

Ratio of cycles taken to execute MTE2 instructions to the total cycles.

aic_mte3_instructions

Number of MTE3 instructions.

aic_mte3_ratio

Ratio of cycles taken to execute MTE3 instructions to the total cycles.