fix: Propagate torch_dtype to sub-configs correctly by athitten · Pull Request #2027 · NVIDIA-NeMo/Automodel

athitten · 2026-04-23T19:32:29Z

What does this PR do ?

Fixes #2017.
Verified by running examples/vlm_finetune/gemma4/gemma4_4b_mock.yaml with torch_dtype: torch.float32 and dont see the error mentioned in the issue anymore.

Changelog

Add specific line by line info of high level changes in this PR.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Signed-off-by: Abhishree Thittenamane <athittenaman@cw-dfw-cs-001-login-02.cm.cluster>

copy-pr-bot · 2026-04-23T19:32:33Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

athitten · 2026-04-23T19:34:41Z

/claude review

athitten · 2026-04-23T19:34:47Z

/ok to test 6de0c36

claude

LGTM

Propagate torch_dtype to sub-configs correctly Signed-off-by: Abhishree Thittenamane <athittenaman@cw-dfw-cs-001-login-02.cm.cluster> Co-authored-by: Abhishree Thittenamane <athittenaman@cw-dfw-cs-001-login-02.cm.cluster>

…o#2027) Repin the Automodel submodule from 26108096 to 6eb5e862 ("fix: Propagate torch_dtype to sub-configs correctly", from NVIDIA-NeMo/Automodel#2027) as a temporary pin. Note: 6eb5e862 is an unmerged PR commit (an older force-pushed revision of NVIDIA-NeMo#2027, not its current head and not on main) and predates the Nemotron-Omni RADIO post-load patches in 26108096. It still pins transformers==5.5.0 in its own metadata, so the transformers override stays consistent. The refreshed uv.lock reflects the reverse-delta (drops the later s3 / msc extras and the wandb>=0.26.1 pin). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

Bump the Automodel submodule to 5dcc9abe9 ("fix: Propagate torch_dtype to sub-configs correctly", NVIDIA-NeMo/Automodel#2027). This is the oldest commit on Automodel main that carries the NVIDIA-NeMo#2027 torch_dtype-propagation fix, so it is reachable by a plain `git submodule update` (unlike the orphaned, force-pushed PR-head revision of the same change, which lives in Automodel's pre-rewrite history and is on no upstream branch). It pins transformers==5.5.0 in its own metadata, keeping the transformers override consistent. uv.lock refreshed accordingly. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

Propagate torch_dtype to sub-configs correctly

6de0c36

Signed-off-by: Abhishree Thittenamane <athittenaman@cw-dfw-cs-001-login-02.cm.cluster>

athitten requested review from HuiyingLi, ZhiyuLi-Nvidia, adil-a, akoumpa, hemildesai, pthombre and zyzhou5 as code owners April 23, 2026 19:32

athitten changed the title ~~Propagate torch_dtype to sub-configs correctly~~ Fix: Propagate torch_dtype to sub-configs correctly Apr 23, 2026

athitten linked an issue Apr 23, 2026 that may be closed by this pull request

[Bug] Gemma4 VLM from_pretrained(torch_dtype=float32) leaves nested submodules in bf16 and breaks FSDP2 #2017

Closed

athitten mentioned this pull request Apr 23, 2026

[Bug] Gemma4 VLM from_pretrained(torch_dtype=float32) leaves nested submodules in bf16 and breaks FSDP2 #2017

Closed

athitten changed the title ~~Fix: Propagate torch_dtype to sub-configs correctly~~ fix: Propagate torch_dtype to sub-configs correctly Apr 23, 2026

copy-pr-bot Bot temporarily deployed to nemo-ci April 23, 2026 19:35 Inactive

copy-pr-bot Bot temporarily deployed to test April 23, 2026 19:35 Inactive

claude Bot approved these changes Apr 23, 2026

View reviewed changes

HuiyingLi approved these changes Apr 23, 2026

View reviewed changes

athitten merged commit 5dcc9ab into main Apr 23, 2026
61 of 65 checks passed

athitten deleted the athitten/gemma4_dtype_fix branch April 23, 2026 22:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Propagate torch_dtype to sub-configs correctly#2027

fix: Propagate torch_dtype to sub-configs correctly#2027
athitten merged 1 commit into
mainfrom
athitten/gemma4_dtype_fix

athitten commented Apr 23, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Apr 23, 2026

Uh oh!

athitten commented Apr 23, 2026

Uh oh!

athitten commented Apr 23, 2026

Uh oh!

claude Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

athitten commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented Apr 23, 2026

Uh oh!

athitten commented Apr 23, 2026

Uh oh!

athitten commented Apr 23, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

athitten commented Apr 23, 2026 •

edited

Loading