Skip to content

[NPU] Adapt cann 8.5: use sfa and lightning indexer op from cann and CI update#17615

Merged
iforgetmyname merged 12 commits intosgl-project:mainfrom
monkeyLoveding:wq_main
Jan 27, 2026
Merged

[NPU] Adapt cann 8.5: use sfa and lightning indexer op from cann and CI update#17615
iforgetmyname merged 12 commits intosgl-project:mainfrom
monkeyLoveding:wq_main

Conversation

@monkeyLoveding
Copy link
Copy Markdown
Contributor

@monkeyLoveding monkeyLoveding commented Jan 23, 2026

Motivation

There are some bugs of sparse flash attention and lightning indexer in custom_ops package. Fixed ops are intergrated in CANN 8.5 so we update these ops and call them using torch_npu.

lightning_indexer: https://gitcode.com/cann/ops-transformer/blob/master/attention/lightning_indexer/README.md
sfa: https://gitcode.com/cann/ops-transformer/blob/master/attention/sparse_flash_attention/README.md

Upgrading the CI to the corresponding version of cann 8.5

Modifications

Accuracy Tests

image

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions Bot added the npu label Jan 23, 2026
@ping1jing2
Copy link
Copy Markdown
Collaborator

what's difference with #17489

@monkeyLoveding
Copy link
Copy Markdown
Contributor Author

monkeyLoveding commented Jan 23, 2026

what's difference with #17489
#17489 was merged into CI Code Adapting to cann8.5

@ping1jing2
Copy link
Copy Markdown
Collaborator

what's difference with #17489
#17189 was merged into CI Code Adapting to cann8.5

sorry, i don't understand what you mean

Comment thread .github/workflows/nightly-test-npu.yml Outdated
STREAMS_PER_DEVICE: 32
run: |
export PATH="/usr/local/Ascend/8.3.RC1/compiler/bishengir/bin:${PATH}"
export PATH="/usr/local/Ascend/cann-8.5.0/aarch64-linux/bin:${PATH}"
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why we need this change?
if bishengir is not needed, just remove this line

@iforgetmyname
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@iforgetmyname
Copy link
Copy Markdown
Collaborator

/rerun-failed-ci

@iforgetmyname iforgetmyname merged commit d578b41 into sgl-project:main Jan 27, 2026
251 of 308 checks passed
charlesHsuGG pushed a commit to charlesHsuGG/sglang that referenced this pull request Jan 28, 2026
@ping1jing2 ping1jing2 self-assigned this Jan 28, 2026
Chen-0210 pushed a commit to Chen-0210/sglang that referenced this pull request Jan 30, 2026
Johnsonms pushed a commit to Johnsonms/sglang that referenced this pull request Feb 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants