Skip to content

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1#5073

Closed
kylehh wants to merge 15 commits intosgl-project:mainfrom
kylehh:khuang-nemotron
Closed

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1#5073
kylehh wants to merge 15 commits intosgl-project:mainfrom
kylehh:khuang-nemotron

Conversation

@kylehh
Copy link
Copy Markdown
Contributor

@kylehh kylehh commented Apr 4, 2025

Motivation

Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1

Modifications

Migration nemotron_nas from vllm

Checklist

@merrymercy merrymercy added the ready-to-merge The PR is ready to merge after the CI is green. label Apr 21, 2025
def __call__(self, layer_id: int, prefix: str) -> torch.nn.Module: ...


class PPMissingLayer(torch.nn.Identity):
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do not add these first. Can you drop all of them?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missinglayer class was migrated from vllm implementation, which was created by Nemotron team. Keep it for code consistent and future proof. Do these cause the failure of CI pipeline?

@zhyncs
Copy link
Copy Markdown
Collaborator

zhyncs commented Jul 5, 2025

@kylehh please rebase. thanks.

@kylehh kylehh requested a review from zhaochenyang20 as a code owner July 10, 2025 02:36
@netanel-haber
Copy link
Copy Markdown
Contributor

Hello @merrymercy @zhyncs!
I'm going to attempt integrating this PR into main again, I'll bring it up to speed.
Thanks!

@kylehh kylehh deleted the khuang-nemotron branch December 15, 2025 17:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready-to-merge The PR is ready to merge after the CI is green.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants