Skip to content

Support next_n = 4 for SM100 MQA logits#230

Closed
RayWang96 wants to merge 2 commits intodeepseek-ai:mainfrom
RayWang96:sm100_next_n_4
Closed

Support next_n = 4 for SM100 MQA logits#230
RayWang96 wants to merge 2 commits intodeepseek-ai:mainfrom
RayWang96:sm100_next_n_4

Conversation

@RayWang96
Copy link
Copy Markdown
Collaborator

Use a cluster to process a query instead of only using an SM.

@benchislett
Copy link
Copy Markdown

Any updates on this?

@RayWang96
Copy link
Copy Markdown
Collaborator Author

Any updates on this?

We have no plan to merge it into main branch

@RayWang96 RayWang96 closed this Feb 24, 2026
LyricZhao added a commit that referenced this pull request Apr 16, 2026
* Use vllm swiglu

* Reset do_cpu_sync

* Minor fix

* Minor fix

* Fit fused kernel

* Minor fix

* Simplify

* Minor fix

* Simplify

* Simplify

* Lints

* Runnable without legacy code

* Simplify

---------

Co-authored-by: Chenggang Zhao <chenggangz@deepseek.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants