Model Inference-Ascend Community

Open-Source Inference Engine

It is a hardware adaptation plugin designed for efficient vLLM inference, enabling seamless interworking between NPUs and the vLLM framework. MindIE Turbo accelerates LLM inference on NPUs, achieving higher throughput and lower latency.

Quick Start

MindIE Inference Engine

MindIE is a high-performance AI inference engine, enabling accelerated execution, debugging, tuning, and rapid migration. With its layered open architecture and unified interfaces, MindIE simplifies development while delivering peak performance through deeply optimized capabilities.

Quick Start

Customer-Developed Inference Engine

Customers can interconnect their inference engines with CANN through open APIs and acceleration libraries. This flexible architecture ensures high performance and stable deployment.

Learn More

Open-Source Inference Engine

Quick Start

MindIE Inference Engine

Quick Start

Customer-Developed Inference Engine

Customers can interconnect their inference engines with CANN through open APIs and acceleration libraries. This flexible architecture ensures high performance and stable deployment.

Learn More

Development Resources

Installation Resources

Get Open-Source Inference Engine Resources

Use the Dockerfile to build an image and prepare the base environment required for models, including CANN, FrameworkPTAdapter, MindIE Turbo, and vLLM, to implement quick model inference. For details, refer to Set up using Docker.

Get MindIE Turbo

Get MindIE Inference Engine Image

This image is pre-configured with the base environment required for model execution, including CANN, FrameworkPTAdapter, MindIE, and ATB Models, enabling rapid inference setup.

Models

View Large Models Supported by vLLM

LLMsand their versions.

Supported:

DeepSeek

Qwen

LLaMA

InternLM

Baichuan

...

View Models

View Large Models Supported by MindIE

LLMs and their versions supported by MindIE.

Supported:

DeepSeek

Qwen

LLaMA

ChatGLM

Baichuan

...

Model Inference

Development Resources

关于昇腾

新闻与活动

交流与资讯

支持与服务

开源社区