Roofline Bottleneck Analysis Chart

A Roofline bottleneck analysis chart can be used to build a processor performance model, which can be used to quickly evaluate the theoretical performance limit of an operator, allowing developers to quickly identify bottlenecks.

  • To use MindStudio Insight, you need to install the MindStudio Insight software package. For details about the download link, see Installation and Uninstallation.
  • For details about how to use MindStudio Insight, see Details.
  • For details about how to import the visualize_data.bin file to MindStudio Insight, see Importing Profile Data.

The visualize_data.bin file generated by msprof op can be imported to MindStudio Insight to display. Roofline analysis charts vary depending on hardware and operator types.

  • For the , the Roofline bottleneck analysis chart contains only the memory unit view.
    Figure 1 Roofline bottleneck analysis chart for
  • For the , the view generated varies according to the operator type. For details, see Table 1.
    Figure 2 Roofline bottleneck analysis chart for
    Table 1 Roofline views supported by

    Roofline View Type

    Vector Operator

    Cube Operator

    Mix Operator

    GM/L2 view

    Vector memory unit

    -

    Vector memory channel

    -

    Vector Pipeline

    -

    Cube memory unit

    -

    Cube memory channel

    -

    Cube Pipeline

    -

The Roofline analysis chart analyzes the performance usage of operators and provides the following objective analysis results:
  • If the operator performance usage is greater than 80%, a message is displayed based on the region.
    • Compute Bound: computing bottleneck.
    • Memory Bound: memory bottleneck.
  • If the operator performance usage is less than 80% and the bound type is latency bound.
    • If the maximum pipeline ratio is less than 80%, the message "latency bound:pipeline caused" is displayed.
    • If the maximum pipeline ratio is greater than 80%, identify the type of the maximum pipeline ratio.
      • If the type of the maximum pipeline ratio is compute pipeline (cube ratio, vector ratio, or scalar ratio), the message "latency bound:compute caused" is displayed.
      • If the type of the maximum pipeline ratio is memory pipeline (mte1 ratio, mte2 ratio, or mte3 ratio), the message "latency bound:memory caused" is displayed.