Skip to content

Support cuda graph for DP attention#2061

Merged
merrymercy merged 4 commits intosgl-project:mainfrom
ispobock:dp-cuda-graph
Nov 18, 2024
Merged

Support cuda graph for DP attention#2061
merrymercy merged 4 commits intosgl-project:mainfrom
ispobock:dp-cuda-graph

Conversation

@ispobock
Copy link
Copy Markdown
Collaborator

Motivation

Support cuda graph for DP attention (#1970).

Comment thread python/sglang/srt/managers/scheduler.py
Comment thread python/sglang/srt/managers/scheduler.py Outdated
Comment thread python/sglang/srt/managers/scheduler.py Outdated
ispobock and others added 3 commits November 18, 2024 08:17
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
Co-authored-by: Lianmin Zheng <lianminzheng@gmail.com>
@merrymercy merrymercy merged commit 62832bb into sgl-project:main Nov 18, 2024
@merrymercy merrymercy mentioned this pull request Nov 20, 2024
timethink pushed a commit to timethink/sglang that referenced this pull request Mar 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants