chore(beep boop 🤖): Bump `uv.lock` (main) (2026-02-17) by ko3n1g · Pull Request #2399 · NVIDIA-NeMo/Megatron-Bridge

ko3n1g · 2026-02-17T01:04:45Z

🚀 PR to bump uv.lock in main.

📝 Please remember the following to-do's before merge:

Verify the presubmit CI

🙏 Please merge this PR only if the CI workflow completed successfully.

Summary by CodeRabbit

Chores
- Updated Megatron-LM submodule dependency to the latest version.

Signed-off-by: Oliver Koenig <okoenig@nvidia.com>

coderabbitai · 2026-02-17T01:08:58Z

📝 Walkthrough

Walkthrough

Updates the Megatron-LM git submodule pointer from commit 76a9f472a58fa054120031a8533404cd799b3f01 to commit 8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2. This is a metadata-only change with no behavioral modifications.

Changes

Cohort / File(s)	Summary
Submodule Reference Update `3rdparty/Megatron-LM`	Updated git submodule pointer to a newer commit in the Megatron-LM dependency.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~2 minutes

Possibly related PRs

chore(beep boop 🤖): Bump uv.lock (main) (2026-02-13) #2377: Sequential submodule pointer update that sets the reference to the prior commit (76a9f47), which this PR advances from.
chore(beep boop 🤖): Bump uv.lock (r0.3.0) (2026-02-12) #2348: Updates the same Megatron-LM submodule pointer to a different commit.
chore(beep boop 🤖): Bump uv.lock (main) (2026-02-12) #2349: Updates the Megatron-LM submodule pointer reference similarly.

Suggested reviewers

yaoyu-33
maanug-nv

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The title mentions 'Bump uv.lock' but the actual changes are to Megatron-LM submodule pointer, not uv.lock file updates.	Update the title to accurately reflect the main change: updating the Megatron-LM submodule pointer, e.g., 'chore: Update Megatron-LM submodule pointer'.
Test Results For Major Changes	⚠️ Warning	PR contains major breaking changes in Megatron-LM submodule but PR description lacks test results and compatibility verification.	Update PR description to include test results demonstrating Megatron-Bridge compatibility with breaking changes and regression testing.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Merge Conflict Detection	✅ Passed	✅ No merge conflicts detected when merging into `main`

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch bump-ci-container-2026-02-17-main

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@3rdparty/Megatron-LM`:
- Line 1: The PR title/description claims a bump to uv.lock but the actual
change updates the Megatron-LM git submodule pointer; update the PR to
accurately reflect the change or adjust the commit contents: either (a) change
the PR title/body to state "update Megatron-LM submodule pointer" and include
the submodule diff referencing Megatron-LM, or (b) revert the unintended
submodule update and include the intended uv.lock changes instead; verify the
commit(s) and branch contain only the intended modifications (uv.lock or
Megatron-LM) before updating the PR.
- Line 1: This submodule update introduces breaking changes: RL artifacts now
use JSON (megatron/rl/rl_utils.py), RL training/inference switches to an
OpenAI-server style interface (deprecated flags: --langrl-inference-server-type
inplace_megatron and chat_mode), DSA indexer APIs have been refactored, and
mamba now exposes mamba_memory_ratio for KV-cache/memory partitioning; to fix,
search the codebase for imports/usages of megatron/rl/rl_utils.py and replace
pickle read/write logic with JSON serialization/deserialization, update any
command-line parsing or config handling that references
--langrl-inference-server-type or chat_mode to the new OpenAI-style inference
config (and adjust callers of the RL inference functions accordingly), audit and
update DSA-related imports/calls to the new indexer symbols, and add handling
for mamba_memory_ratio where Mamba inference is configured (expose the new
parameter in configs and ensure memory partitioning logic uses it); run RL
training/inference and DSA/mamba unit/integration tests to validate behavior
before merging.

coderabbitai · 2026-02-17T01:09:01Z

@@ -1 +1 @@
-Subproject commit 76a9f472a58fa054120031a8533404cd799b3f01
+Subproject commit 8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2


⚠️ Potential issue | 🟠 Major

PR description doesn't match actual changes.

The PR title and description indicate this is bumping uv.lock, but the actual change updates the Megatron-LM git submodule pointer. This inconsistency suggests either:

The wrong files were included in this PR

The PR description is incorrect and should reference the submodule update instead

Please verify that the intended changes are included in this PR.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@3rdparty/Megatron-LM` at line 1, The PR title/description claims a bump to uv.lock but the actual change updates the Megatron-LM git submodule pointer; update the PR to accurately reflect the change or adjust the commit contents: either (a) change the PR title/body to state "update Megatron-LM submodule pointer" and include the submodule diff referencing Megatron-LM, or (b) revert the unintended submodule update and include the intended uv.lock changes instead; verify the commit(s) and branch contain only the intended modifications (uv.lock or Megatron-LM) before updating the PR.

⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

# Check if the submodule exists and what its current state is git ls-files -s | grep 3rdparty/Megatron-LM

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 141

🏁 Script executed:

# Check the .gitmodules file to understand the submodule configuration cat .gitmodules | grep -A 3 "Megatron-LM" || echo "No matching entry found"

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 183

🏁 Script executed:

# Check if the submodule is initialized and get its current commit if [ -d "3rdparty/Megatron-LM/.git" ]; then cd 3rdparty/Megatron-LM git rev-parse HEAD cd ../.. else echo "Submodule not initialized or not a git directory" fi

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 119

🏁 Script executed:

# Fetch commit information from the remote Megatron-LM repository git ls-remote https://github.com/NVIDIA/Megatron-LM.git refs/heads/ | head -20

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 53

🏁 Script executed:

# Try to get commit log between the two commits using git # First, let's fetch the actual commits python3 << 'PYTHON_EOF' import subprocess import json old_commit = "76a9f472a58fa054120031a8533404cd799b3f01" new_commit = "8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2" # Use git log with a temporary fetch to get commit info # We'll use git for-each-ref or fetch the commits try: # Try to get the commits using git result = subprocess.run( ['git', 'log', '--format=%H %s', f'{old_commit}..{new_commit}'], capture_output=True, text=True, timeout=10 ) if result.returncode == 0 and result.stdout: print("Commits found locally:") print(result.stdout) else: print("Commits not found locally, will need to fetch from remote") print(f"Old commit: {old_commit}") print(f"New commit: {new_commit}") except Exception as e: print(f"Error: {e}") PYTHON_EOF

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 234

🏁 Script executed:

# Try initializing the submodule to check out the commits cd 3rdparty/Megatron-LM 2>/dev/null || { git submodule update --init 3rdparty/Megatron-LM cd 3rdparty/Megatron-LM } if [ -d ".git" ] || [ -f ".git" ]; then OLD_COMMIT="76a9f472a58fa054120031a8533404cd799b3f01" NEW_COMMIT="8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2" echo "=== Commits included in this submodule update ===" git log --oneline --no-merges "${OLD_COMMIT}..${NEW_COMMIT}" 2>/dev/null | head -20 echo "" echo "=== Summary of changes (first 30 files) ===" git diff --stat "${OLD_COMMIT}..${NEW_COMMIT}" 2>/dev/null | head -30 else echo "Submodule still not initialized" fi

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 102

🏁 Script executed:

# Fetch commit information from GitHub API curl -s "https://api.github.com/repos/NVIDIA/Megatron-LM/commits/76a9f472a58fa054120031a8533404cd799b3f01" | jq '.commit.message' 2>/dev/null || echo "Commit fetch failed"

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 163

🏁 Script executed:

# Try to get comparison between the two commits curl -s "https://api.github.com/repos/NVIDIA/Megatron-LM/compare/76a9f472a58fa054120031a8533404cd799b3f01...8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2" | python3 << 'PYTHON_EOF' import json import sys try: data = json.load(sys.stdin) if 'commits' in data: print(f"Total commits: {len(data['commits'])}") print("\n=== Commits ===") for commit in data['commits'][:20]: # Show first 20 print(f"{commit['sha'][:7]} {commit['commit']['message'].split(chr(10))[0]}") if 'files' in data: print(f"\n=== Files changed: {len(data['files'])} ===") for f in data['files'][:30]: # Show first 30 print(f"{f['status']}: {f['filename']} ({f.get('additions', 0)} +, {f.get('deletions', 0)} -)") except Exception as e: print(f"Error: {e}", file=sys.stderr) PYTHON_EOF

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 119

🏁 Script executed:

# Fetch comparison between the two commits with better error handling curl -s -H "Accept: application/vnd.github.v3+json" \ "https://api.github.com/repos/NVIDIA/Megatron-LM/compare/76a9f472a58fa054120031a8533404cd799b3f01...8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2" \ | python3 -c " import json import sys try: data = json.load(sys.stdin) if 'message' in data: print(f'Error: {data[\"message\"]}') elif 'commits' in data: print(f'Total commits: {len(data[\"commits\"])}') print() for commit in data['commits'][:30]: msg = commit['commit']['message'].split('\n')[0] print(f'{commit[\"sha\"][:7]} {msg}') else: print('No commits field in response') print(json.dumps(data, indent=2)[:500]) except json.JSONDecodeError as e: print(f'JSON Error: {e}') except Exception as e: print(f'Error: {e}') "

Repository: NVIDIA-NeMo/Megatron-Bridge

Length of output: 739

🌐 Web query:

Megatron-LM commits 76a9f472a58fa054120031a8533404cd799b3f01 8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2 changes breaking

💡 Result:

Between 76a9f47 and 8f1c2f8, the changes most likely to be breaking are:

RL training/inference interface switch to the “Inference OpenAI API server”: the RL example command line removes --langrl-inference-server-type inplace_megatron, and RL environment/agent configs drop “chat-mode” style knobs like chat_mode / assistant_suffix (and related prompt formatting paths), so existing RL configs/scripts using those fields/flags will fail or behave differently. [1]

RL artifacts format change (pickle → JSON): megatron/rl/rl_utils.py stops writing *.pkl files in binary via pickle.dump(...) and instead writes *.json (text) via json.dump([[r.model_dump() ...]]). Any downstream tooling expecting pickle files or Python-object roundtrips will break. [2]

Experimental DSA attention API refactor: “Decouple topk and loss from DSA Indexer” adds new helper functions (e.g., fused_qk_topk_naive, FusedDSAIndexerLoss) and removes/relocates prior logic (e.g., _compute_index_scores as previously structured), which can break any code importing/depending on those internal symbols/behaviors. [3]

Inference dynamic batching behavior/args (hybrid Mamba models): adds InferenceConfig.mamba_memory_ratio and a new CLI flag --inference-dynamic-batching-mamba-memory-ratio; enabling it changes how KV-cache vs Mamba-state memory is partitioned and can clamp max_requests, which can alter throughput/memory assumptions for existing deployments. [4]

Sources: [1] commit 7a8f305 patch, [2] commit 8f1c2f8 diff, [3] commit b890099 patch, [4] commit 4beb8ca patch.

Review and integrate breaking changes from this submodule update carefully.

This submodule update includes 11 commits with several breaking changes that require code updates and testing:

RL artifacts format migration: RL utilities now write JSON instead of pickle files (megatron/rl/rl_utils.py). Any code reading pickle artifacts or expecting Python object serialization will break.

RL training/inference interface change: The update switches to the OpenAI API server inference model. Existing RL configs and command-line arguments (e.g., --langrl-inference-server-type inplace_megatron, chat_mode) are no longer supported.

DSA attention API refactor: Internal DSA indexer symbols and behaviors have changed. Code importing from DSA modules may need updates.

Mamba inference dynamic batching: New mamba_memory_ratio parameter controls KV-cache vs Mamba-state memory partitioning, affecting throughput and memory allocation on Mamba-based deployments.

Verify that the Megatron-Bridge codebase is compatible with these changes before merging, particularly in RL training/inference paths and any code depending on Megatron-LM's internal APIs.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@3rdparty/Megatron-LM` at line 1, This submodule update introduces breaking changes: RL artifacts now use JSON (megatron/rl/rl_utils.py), RL training/inference switches to an OpenAI-server style interface (deprecated flags: --langrl-inference-server-type inplace_megatron and chat_mode), DSA indexer APIs have been refactored, and mamba now exposes mamba_memory_ratio for KV-cache/memory partitioning; to fix, search the codebase for imports/usages of megatron/rl/rl_utils.py and replace pickle read/write logic with JSON serialization/deserialization, update any command-line parsing or config handling that references --langrl-inference-server-type or chat_mode to the new OpenAI-style inference config (and adjust callers of the RL inference functions accordingly), audit and update DSA-related imports/calls to the new indexer symbols, and add handling for mamba_memory_ratio where Mamba inference is configured (expose the new parameter in configs and ensure memory partitioning logic uses it); run RL training/inference and DSA/mamba unit/integration tests to validate behavior before merging.

) Signed-off-by: Oliver Koenig <okoenig@nvidia.com> Co-authored-by: maanug-nv <109391026+maanug-nv@users.noreply.github.com> Signed-off-by: pengdurice <pengduhit@gmail.com>

Signed-off-by: Oliver Koenig <okoenig@nvidia.com> Co-authored-by: maanug-nv <109391026+maanug-nv@users.noreply.github.com>

chore(beep boop 🤖): Bump uv.lock (main) (2026-02-17)

77f5393

Signed-off-by: Oliver Koenig <okoenig@nvidia.com>

copy-pr-bot Bot temporarily deployed to nemo-ci February 17, 2026 01:05 Inactive

copy-pr-bot Bot temporarily deployed to test February 17, 2026 01:05 Inactive

coderabbitai Bot reviewed Feb 17, 2026

View reviewed changes

maanug-nv approved these changes Feb 17, 2026

View reviewed changes

copy-pr-bot Bot temporarily deployed to nemo-ci February 17, 2026 01:38 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 17, 2026 02:15 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 17, 2026 02:25 Inactive

This was referenced Feb 25, 2026

chore(beep boop 🤖): Bump uv.lock (r0.3.0) (2026-02-25) #2524

Merged

chore(beep boop 🤖): Bump uv.lock (main) (2026-02-26) #2554

Closed

chore(beep boop 🤖): Bump uv.lock (r0.3.0) (2026-02-26) #2555

Closed

copy-pr-bot Bot pushed a commit that referenced this pull request Mar 19, 2026

chore(beep boop 🤖): Bump uv.lock (main) (2026-02-17) (#2399)

7ddcf0a

Signed-off-by: Oliver Koenig <okoenig@nvidia.com> Co-authored-by: maanug-nv <109391026+maanug-nv@users.noreply.github.com>

This was referenced Mar 20, 2026

chore(beep boop 🤖): Bump uv.lock (main, mcore-dev) (2026-03-20) #2917

Closed

chore(beep boop 🤖): Bump uv.lock (main, mcore-main) (2026-03-26) #2996

Closed

coderabbitai Bot mentioned this pull request Mar 28, 2026

chore(beep boop 🤖): Bump uv.lock (main, mcore-dev) (2026-03-28) #3013

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(beep boop 🤖): Bump `uv.lock` (main) (2026-02-17)#2399

chore(beep boop 🤖): Bump `uv.lock` (main) (2026-02-17)#2399
ko3n1g merged 1 commit into
mainfrom
bump-ci-container-2026-02-17-main

ko3n1g commented Feb 17, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Feb 17, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (2 warnings)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -1 +1 @@
		Subproject commit 76a9f472a58fa054120031a8533404cd799b3f01
		Subproject commit 8f1c2f8ae53b4e3f32c0ae7f397d8b38a675eaa2

Conversation

ko3n1g commented Feb 17, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Feb 17, 2026

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

❌ Failed checks (2 warnings)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ko3n1g commented Feb 17, 2026 •

edited by coderabbitai Bot

Loading