[model] feat: Add limited Gemma4 dense model support by pavelgein · Pull Request #3885 · NVIDIA-NeMo/Megatron-Bridge

pavelgein · 2026-05-19T12:29:16Z

What does this PR do ?

Current implementation of Gemma4VL Bridge supports only MoE models.
This restrictions is loosed to support Gemma4 31B model, since it does not have a per-layer embeddings

Changelog

Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI section in the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

copy-pr-bot · 2026-05-19T12:29:20Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

yaoyu-33 · 2026-05-20T22:23:04Z

/claude review

claude · 2026-05-20T22:27:19Z

Light Review - Issues found: (1) Typo gemma4_provider.py:349 decided base on should be decided based on (2) Missing space in error message gemma4_vl_bridge.py:83 f-string period has no trailing space so text runs together (3) Confusing error wording gemma4_vl_bridge.py:81 or model without hidden_size_per_layer_input is hard to parse (4) Copy-paste docstring test_gemma4_vl_conversion.py:300 says MoE bridge should say Dense bridge. See inline comments for code suggestions. Suggested test cases: No perf tests impacted.

kamran-nvidia · 2026-05-20T22:58:24Z

@pavelgein Please address Claude's comments. Thanks

kamran-nvidia · 2026-05-21T12:11:48Z

/ok to test f6f5420

kamran-nvidia · 2026-05-21T15:14:35Z

@pavelgein Please address the CI failures. I will run the CI again once addressed.

kamran-nvidia · 2026-05-21T18:48:42Z

/ok to test 8ea504c

Signed-off-by: Pavel Gein <pavel.gein@gmail.com>

pavelgein · 2026-05-22T06:23:51Z

I have added support for dense Gemma4 model (not VLM one), so could you please review once again?

kamran-nvidia · 2026-05-22T16:20:31Z

/ok to test 2305b42

Signed-off-by: Pavel Gein <pavel.gein@gmail.com> Co-authored-by: Kamran Jafari <kjafarisadeg@nvidia.com> Signed-off-by: Vasudevan Rengasamy <vrengasamy@nvidia.com>

github-actions Bot added the community-request label May 19, 2026

yaoyu-33 added area:model Model implementations and HF bridge logic feature New capabilities, enhancements, or enablement work waiting-on-maintainers Waiting on maintainers to respond labels May 19, 2026

kamran-nvidia requested a review from yaoyu-33 May 20, 2026 18:23

yaoyu-33 previously approved these changes May 20, 2026

View reviewed changes

yaoyu-33 added ready-to-merge PR is approved, current, and only waiting for CI to pass before merge and removed waiting-on-maintainers Waiting on maintainers to respond labels May 20, 2026

claude Bot reviewed May 20, 2026

View reviewed changes

Comment thread src/megatron/bridge/models/gemma/gemma4_provider.py Outdated

claude Bot reviewed May 20, 2026

View reviewed changes

Comment thread src/megatron/bridge/models/gemma_vl/gemma4_vl_bridge.py

claude Bot reviewed May 20, 2026

View reviewed changes

Comment thread tests/functional_tests/test_groups/models/gemma_vl/test_gemma4_vl_conversion.py Outdated

pavelgein dismissed yaoyu-33’s stale review via f6f5420 May 21, 2026 02:57

copy-pr-bot Bot temporarily deployed to public May 21, 2026 12:12 Inactive

copy-pr-bot Bot temporarily deployed to test May 21, 2026 12:12 Inactive

copy-pr-bot Bot temporarily deployed to public May 21, 2026 13:34 Inactive

copy-pr-bot Bot temporarily deployed to public May 21, 2026 13:49 Inactive

kamran-nvidia added waiting-on-customer Waiting on the original author to respond and removed ready-to-merge PR is approved, current, and only waiting for CI to pass before merge labels May 21, 2026

copy-pr-bot Bot temporarily deployed to public May 21, 2026 18:49 Inactive

copy-pr-bot Bot temporarily deployed to test May 21, 2026 18:49 Inactive

copy-pr-bot Bot temporarily deployed to public May 21, 2026 19:20 Inactive

copy-pr-bot Bot temporarily deployed to public May 21, 2026 19:38 Inactive

kamran-nvidia added ready-to-merge PR is approved, current, and only waiting for CI to pass before merge and removed waiting-on-customer Waiting on the original author to respond labels May 21, 2026

kamran-nvidia previously approved these changes May 21, 2026

View reviewed changes

pavelgein added 5 commits May 22, 2026 11:22

[model] feat: Add limited Gemma4 dense model support

6a5f38a

Signed-off-by: Pavel Gein <pavel.gein@gmail.com>

fix kv tying for dense models

9cc765e

Signed-off-by: Pavel Gein <pavel.gein@gmail.com>

fix docstrings

1dc0cbd

Signed-off-by: Pavel Gein <pavel.gein@gmail.com>

tests

99740fe

Signed-off-by: Pavel Gein <pavel.gein@gmail.com>

[model,tests] feat: support Gemma4 dense model in bridge

49d9bba

Signed-off-by: Pavel Gein <pavel.gein@gmail.com>

pavelgein dismissed kamran-nvidia’s stale review via 49d9bba May 22, 2026 06:22

pavelgein force-pushed the gemma4_31b branch from 8ea504c to 49d9bba Compare May 22, 2026 06:22

kamran-nvidia requested a review from yaoyu-33 May 22, 2026 12:01

kamran-nvidia added needs-review PR is ready for code review and waiting on a reviewer and removed ready-to-merge PR is approved, current, and only waiting for CI to pass before merge labels May 22, 2026

yaoyu-33 added ready-to-merge PR is approved, current, and only waiting for CI to pass before merge and removed needs-review PR is ready for code review and waiting on a reviewer labels May 22, 2026

yaoyu-33 approved these changes May 22, 2026

View reviewed changes

Merge branch 'main' into gemma4_31b

2305b42

copy-pr-bot Bot temporarily deployed to public May 22, 2026 16:21 Inactive

copy-pr-bot Bot temporarily deployed to test May 22, 2026 16:21 Inactive

copy-pr-bot Bot temporarily deployed to public May 23, 2026 06:56 Inactive

copy-pr-bot Bot temporarily deployed to public May 23, 2026 07:11 Inactive

kamran-nvidia merged commit 9e19852 into NVIDIA-NeMo:main May 23, 2026
73 checks passed

cuichenx mentioned this pull request May 26, 2026

[NeMo FW 26.06 Release] MBridge v0.5.0 Roadmap #3754

Open

Zhichenzzz mentioned this pull request May 28, 2026

[gemma4] feat: add Gemma-4 31B dense model support radixark/Megatron-Bridge#8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model] feat: Add limited Gemma4 dense model support#3885

[model] feat: Add limited Gemma4 dense model support#3885
kamran-nvidia merged 6 commits into
NVIDIA-NeMo:mainfrom
pavelgein:gemma4_31b

pavelgein commented May 19, 2026

Uh oh!

copy-pr-bot Bot commented May 19, 2026

Uh oh!

yaoyu-33 commented May 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude Bot commented May 20, 2026 •

edited

Loading

Uh oh!

kamran-nvidia commented May 20, 2026

Uh oh!

kamran-nvidia commented May 21, 2026

Uh oh!

kamran-nvidia commented May 21, 2026

Uh oh!

kamran-nvidia commented May 21, 2026

Uh oh!

pavelgein commented May 22, 2026

Uh oh!

kamran-nvidia commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pavelgein commented May 19, 2026

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented May 19, 2026

Uh oh!

yaoyu-33 commented May 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kamran-nvidia commented May 20, 2026

Uh oh!

kamran-nvidia commented May 21, 2026

Uh oh!

kamran-nvidia commented May 21, 2026

Uh oh!

kamran-nvidia commented May 21, 2026

Uh oh!

pavelgein commented May 22, 2026

Uh oh!

kamran-nvidia commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

claude Bot commented May 20, 2026 •

edited

Loading