Skip to content

Safely clean up recent upstreamed code#3988

Merged
maanug-nv merged 8 commits into
NVIDIA-NeMo:mainfrom
maanug-nv:import-upstreamed
May 29, 2026
Merged

Safely clean up recent upstreamed code#3988
maanug-nv merged 8 commits into
NVIDIA-NeMo:mainfrom
maanug-nv:import-upstreamed

Conversation

@maanug-nv

Copy link
Copy Markdown
Contributor

What does this PR do ?

Import, delete, and re-export utils, functions, classes that were recently upstreamed to MLM.

Changelog

  • Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI section in the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

  • Related to # (issue)

maanug-nv added 5 commits May 26, 2026 18:57
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Signed-off-by: Maanu Grover <maanug@nvidia.com>
"""Restore ModelOpt quantization/sparsity state."""

# === HuggingFace Metadata ===
hf_model_id: str | None = None

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

note, these HuggingFace specific fields are no longer available and are replaced by extra_checkpoint_metadata.
When migrating the conversion bridges to use configs based on this class (GPTModelConfig, HybridModelConfig, etc., replacing the providers), the hf_model_id should be read and written from extra_checkpoint_metadata dict. Should confirm that this gets serialized to/deserialized from the checkpoint so that it doesn't break users, eg export+deploy.

cc @yaoyu-33

@yaoyu-33 yaoyu-33 added area:misc Cross-cutting utilities, logging, helpers, and other changes feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer labels May 27, 2026
yaoyu-33
yaoyu-33 previously approved these changes May 27, 2026
Signed-off-by: Maanu Grover <maanug@nvidia.com>
@maanug-nv maanug-nv merged commit ee51bae into NVIDIA-NeMo:main May 29, 2026
99 checks passed
erictang000 added a commit to NovaSky-AI/SkyRL that referenced this pull request Jun 9, 2026
…ron-bridge to latest (#1762)

This PR makes the following bumps

megatron_core from `cefc2520158c7ceba3f9adbe4b547a6f7a118da1` (latest
dev branch as of 6/8/26) to `71e418ea7d7b3a6c9a53238c543c3e0b43e11026`
(latest main branch as of 6/8/26.

megatron-bridge from `8382dc343b07b068a827ca20bae860633df3baa0` to
`91a15142a4b4442a8d46ab539d1b923bd08570d0` (latest main 6/8)

Megatron-Bridge has upstreamed code to the main branch that isn't on the
dev branch that is needed to use Megatron-Bridge
(NVIDIA-NeMo/Megatron-Bridge#3988)

Since sequence packing with GDN is now supported on main, we can move
back over to the latest commit on the main branch:
NVIDIA/Megatron-LM#2645
vasunvidia pushed a commit to vasunvidia/Megatron-Bridge that referenced this pull request Jun 10, 2026
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Signed-off-by: Vasudevan Rengasamy <vrengasamy@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:misc Cross-cutting utilities, logging, helpers, and other changes feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants