[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1 by kylehh · Pull Request #5073 · sgl-project/sglang

kylehh · 2025-04-04T22:11:47Z

Motivation

Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1

Modifications

Migration nemotron_nas from vllm

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

merrymercy · 2025-04-27T22:28:37Z

    def __call__(self, layer_id: int, prefix: str) -> torch.nn.Module: ...


+class PPMissingLayer(torch.nn.Identity):


Do not add these first. Can you drop all of them?

Missinglayer class was migrated from vllm implementation, which was created by Nemotron team. Keep it for code consistent and future proof. Do these cause the failure of CI pipeline?

zhyncs · 2025-07-05T06:26:09Z

@kylehh please rebase. thanks.

netanel-haber · 2025-08-10T13:40:17Z

Hello @merrymercy @zhyncs!
I'm going to attempt integrating this PR into main again, I'll bring it up to speed.
Thanks!

root and others added 9 commits April 2, 2025 23:09

init commit

c7fa255

import nemotron_nas

948b093

update sampler and compute_logits

3843751

logit_process update

2b6fec9

add test

f4dd95d

config update

44b1607

remove test py

2e927a4

code reformat

52754ad

formatting

6b30430

kylehh requested review from ByronHsu, Ying1123, hnyls2002, ispobock, merrymercy and zhyncs as code owners April 4, 2025 22:11

kylehh mentioned this pull request Apr 5, 2025

[Bug] Testing new Llama-3_3-Nemotron-Super-49B-v1 by Nvidia: "Model architectures ['DeciLMForCausalLM'] are not supported for now." #4689

Closed

5 tasks

kylehh and others added 3 commits April 5, 2025 02:12

format

4952a25

Merge branch 'main' into khuang-nemotron

e1cde31

update to support Nvidia Nemotron Ultra

1e63870

merrymercy added the ready-to-merge The PR is ready to merge after the CI is green. label Apr 21, 2025

Merge branch 'main' into khuang-nemotron

7461fb2

merrymercy requested changes Apr 27, 2025

View reviewed changes

zhyncs assigned yizhang2077 Jul 5, 2025

Merge branch 'main' into khuang-nemotron

2aacd08

kylehh requested a review from zhaochenyang20 as a code owner July 10, 2025 02:36

Merge branch 'main' into khuang-nemotron

056758e

netanel-haber mentioned this pull request Aug 11, 2025

model: support nvidia/Llama-3_3-Nemotron-Super-49B-v1 #9067

Merged

5 tasks

kylehh closed this Aug 11, 2025

kylehh deleted the khuang-nemotron branch December 15, 2025 17:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1#5073

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1#5073
kylehh wants to merge 15 commits intosgl-project:mainfrom
kylehh:khuang-nemotron

kylehh commented Apr 4, 2025

Uh oh!

merrymercy Apr 27, 2025

Uh oh!

kylehh Apr 28, 2025

Uh oh!

zhyncs commented Jul 5, 2025

Uh oh!

netanel-haber commented Aug 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

		def __call__(self, layer_id: int, prefix: str) -> torch.nn.Module: ...


		class PPMissingLayer(torch.nn.Identity):

Conversation

kylehh commented Apr 4, 2025

Motivation

Modifications

Checklist

Uh oh!

merrymercy Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

kylehh Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

zhyncs commented Jul 5, 2025

Uh oh!

netanel-haber commented Aug 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants