Skip to content

[Roadmap] Ascend NPU Development (2026 Q1) #13664

@iforgetmyname

Description

@iforgetmyname

It's our honour to be one of the supported hardware backends of SGLang ever since July, 2025. Many works were done in the past quarter to support major features and common models. Also we have received a lot of feedbacks and demands that help us improve our using experience and stability, thank you all for your loud voice.

For the next quarter, our goal is continue to improve model performance and increase range of supported features. We are thrilled to announce our development roadmap for 2026 Q1 here:

CI

  • Switch to Ascend NPU daily docker release to reduce setup time
  • Split PR tests and daily tests to reduce PR test cost

Basic Software Version Bump

User / Developer Experience

Model Support & Model Performance

Major Features

Ascend NPU Hardware Backend

Parallelism

Expert Parallelism

Context Parallelism

NPUGraph and Torch Compile

Quantization

HiCache

Speculative Decoding

Reinforcement Learning

  • Basic support on Reinforcement Learning

More Frameworks

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions