feat: Support Nemotron-nano-v3 Omni AutoModel Path by yuekaizhang · Pull Request #2362 · NVIDIA-NeMo/RL

yuekaizhang · 2026-04-29T10:25:44Z

This PR follows the nano-v3-omni mbridge training branch to add AutoModel backend support for Nemotron-Nano-Omni.

Signed-off-by: Yuekai Zhang <yuekaiz@cw-dfw-cs-001-vscode-02.cm.cluster> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Add .claude/settings.local.json, .codex, and .humanize/ to .gitignore as these are local tool configuration/cache files that should not be tracked. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

- Add MMPRTinyDataset class with HF download fallback and local cache support - Add format_mmpr_tiny_dataset for OpenAI-API message conversion - Port verl_geo3k reward function from old Megatron-Bridge implementation - Register mmpr-tiny in DATASET_REGISTRY and vlm_hf_data_processor - Register verl_geo3k reward in VLMVerifyWorker - Add mathruler dependency to pyproject.toml for answer grading - Create debug-friendly YAML config with step_400 checkpoint - Create launch script with uv sync pre-step - Create CoT prompt file with \boxed{} instruction Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

- Remove self.preprocessor from MMPRTinyDataset to prevent double formatting (vlm_hf_data_processor already handles format dispatch) - Add pylatexenc to pyproject.toml as transitive dependency of mathruler (mathruler does not declare it in its own metadata) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

- Fix prompt file brace escaping: \boxed{} -> \boxed{{}} so str.format() has exactly one replacement placeholder (fixes AC-2/AC-5) - Add explicit split parameter validation to MMPRTinyDataset with ValueError for unsupported splits (fixes AC-1 negative test) - Regenerate uv.lock with mathruler and pylatexenc entries (fixes AC-6) - Add unit tests for dataset formatting, split validation, prompt file format compatibility, and verl_geo3k_reward (14 tests, all pass) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Add 3 processor smoke tests using a stub NemotronNanoVLV2Processor: - test_processor_produces_valid_datum_spec: verifies DatumSpec fields - test_prompted_text_contains_boxed_literal: verifies \boxed{} survives - test_placeholder_conversion_for_nemotron_processor: verifies <image> placeholder and question text in vllm_content Uses a tiny 1x1 PNG fixture for image resolution. All 17 MMPR tests pass (11 dataset + 6 reward). Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

- Stub now captures the exact text arg passed to __call__ - Assert exact equality: vllm_content == "<image>\n" + prompted_question - Assert exactly one <image> token in output (no duplicates) - Negative assertion: raw dataset string "<image>\nQuestion" not in output - Assert captured __call__ text matches expected tokenizer input - All tests run with -p no:testmon --override-ini='addopts=' Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Clean up partial state (stale images_dir, parquet, temp dir) before re-downloading when the ready marker is absent. Prevents shutil.move from nesting images/images when images_dir already exists from an interrupted prior attempt. Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

MMPR-Tiny has 4,192 rows with multiple images (up to 11). The previous code truncated to images[0], losing visual context for multi-image questions. - _load_mmpr_tiny_from_cache: keep all image paths instead of [imgs[0]] - format_mmpr_tiny_dataset: split question on <image> and <image_N> placeholders, interleave image content items with text segments - Add tests for multi-image and numbered-placeholder formatting Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Signed-off-by: root <zhangyuekai@foxmail.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

…nting Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

…sound_projection to resume training Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

github-actions · 2026-05-28T05:22:35Z

✅ Submodule Fast-Forward Check Results

Check based on commit: 52a8808 (PR #2362 from nemotron)

✅ Submodules that are properly updated:

Automodel: ✅ PR branch is ahead of main branch (fast-forward)

All submodule changes look good! ✨

Resolved conflicts: - pyproject.toml: kept transformers==5.5.0 (gemma support) over origin/main's 5.3.0, force-overriding past vLLM 0.20.0's !=5.5.0 constraint; took origin/main's mlflow>=3.12.0. - tests/unit/test_recipes_and_test_suites.py: kept nightly GPU-hours ceiling at 1410 (merged suite dry-runs to 1409 GPU hours; 1360 would fail). - uv.lock: regenerated from merged pyproject.toml (transformers 5.3.0 -> 5.5.0). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

yuekaizhang · 2026-06-01T04:08:16Z

/ok to test dfaeb37

github-actions · 2026-06-01T04:08:19Z

✅ Submodule Fast-Forward Check Results

Check based on commit: dfaeb37 (PR #2362 from nemotron)

✅ Submodules that are properly updated:

Automodel: ✅ PR branch is ahead of main branch (fast-forward)

All submodule changes look good! ✨

…lues Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

github-actions · 2026-06-01T06:39:58Z

✅ Submodule Fast-Forward Check Results

Check based on commit: ef24ee2 (PR #2362 from nemotron)

✅ Submodules that are properly updated:

Automodel: ✅ PR branch is ahead of main branch (fast-forward)

All submodule changes look good! ✨

yuekaizhang · 2026-06-01T06:40:00Z

/ok to test ef24ee2

This reverts commit 5de2521. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

yuekaizhang · 2026-06-02T05:09:16Z

/ok to test 96c3ffe

github-actions · 2026-06-02T05:09:31Z

✅ Submodule Fast-Forward Check Results

Check based on commit: 0fa4b2d (PR #2362 from nemotron)

✅ Submodules that are properly updated:

Automodel: ✅ PR branch is ahead of main branch (fast-forward)

All submodule changes look good! ✨

github-actions · 2026-06-02T05:09:47Z

✅ Submodule Fast-Forward Check Results

Check based on commit: 96c3ffe (PR #2362 from nemotron)

✅ Submodules that are properly updated:

Automodel: ✅ PR branch is ahead of main branch (fast-forward)

All submodule changes look good! ✨

yuekaizhang and others added 28 commits April 29, 2026 02:55

bump to vllm 0.19 latest

9d4a9cc

Signed-off-by: Yuekai Zhang <yuekaiz@cw-dfw-cs-001-vscode-02.cm.cluster> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

close optimizer save

9c97187

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

update config multi node

5cdd202

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

add 3rd party to ignore

2571a86

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

chore: add testmondata to .gitignore

6512967

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

filter multi image examples

6e6890b

Signed-off-by: root <zhangyuekai@foxmail.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

update ignore

08a1a44

Signed-off-by: root <zhangyuekai@foxmail.com> Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

replaced mathruler

fd107b8

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

add 32 node config

690d898

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

vllm gpu mem 0.2, illegal instruction

2a7f8a5

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

enable activation checkpointing

cc802e1

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

automodel image_flag support, dynamic resolution, activation checkpoi…

ee75c34

…nting Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

Post-hoc fused_attn=True on RADIO ViT blocks, Freeze sound_encoder / …

a89262a

…sound_projection to resume training Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

change 4node yaml to small batch

c1b22bd

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

update 1 node debug yamml

1d9705c

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

update vllm 0.20.0

a81b5a6

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

update config

9e08086

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

remove useless scripts

eeb93f7

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

chore: apply pre-commit auto-fixes (ruff, ruff-format, taplo)

d6adc41

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

revert: restore .gitignore to upstream version

23c4a0e

Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>

yuekaizhang requested review from a team as code owners April 29, 2026 10:25

copy-pr-bot Bot temporarily deployed to nemo-ci May 28, 2026 05:22 Inactive

copy-pr-bot Bot temporarily deployed to public May 28, 2026 05:22 Inactive

copy-pr-bot Bot temporarily deployed to public May 28, 2026 05:27 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci May 28, 2026 06:04 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci May 28, 2026 07:45 Failure

yuekaizhang dismissed jinglinglingling’s stale review via dfaeb37 June 1, 2026 04:07

copy-pr-bot Bot temporarily deployed to public June 1, 2026 04:08 Inactive

copy-pr-bot Bot temporarily deployed to public June 1, 2026 04:09 Inactive

copy-pr-bot Bot temporarily deployed to test June 1, 2026 04:11 Inactive

copy-pr-bot Bot temporarily deployed to public June 1, 2026 04:12 Inactive

test: assert relative reward ordering instead of brittle hardcoded va…

ef24ee2

…lues Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

copy-pr-bot Bot temporarily deployed to public June 1, 2026 06:40 Inactive

copy-pr-bot Bot temporarily deployed to test June 1, 2026 06:43 Inactive

copy-pr-bot Bot temporarily deployed to public June 1, 2026 06:44 Inactive

yuekaizhang and others added 2 commits June 1, 2026 22:06

Revert "fix(vllm): force sleep level=1 for Nano-Nemotron-VL/Omni models"

0fa4b2d

This reverts commit 5de2521. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Signed-off-by: root <zhangyuekai@foxmail.com>

Merge branch 'main' into nemotron

96c3ffe

copy-pr-bot Bot temporarily deployed to public June 2, 2026 05:09 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support Nemotron-nano-v3 Omni AutoModel Path #2362

feat: Support Nemotron-nano-v3 Omni AutoModel Path #2362
yuekaizhang wants to merge 64 commits into
NVIDIA-NeMo:mainfrom
yuekaizhang:nemotron

yuekaizhang commented Apr 29, 2026

Uh oh!

github-actions Bot commented May 28, 2026

Uh oh!

yuekaizhang commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

yuekaizhang commented Jun 1, 2026

Uh oh!

yuekaizhang commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

yuekaizhang commented Apr 29, 2026

Uh oh!

github-actions Bot commented May 28, 2026

✅ Submodule Fast-Forward Check Results

✅ Submodules that are properly updated:

Uh oh!

yuekaizhang commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

✅ Submodule Fast-Forward Check Results

✅ Submodules that are properly updated:

Uh oh!

github-actions Bot commented Jun 1, 2026

✅ Submodule Fast-Forward Check Results

✅ Submodules that are properly updated:

Uh oh!

yuekaizhang commented Jun 1, 2026

Uh oh!

yuekaizhang commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026

✅ Submodule Fast-Forward Check Results

✅ Submodules that are properly updated:

Uh oh!

github-actions Bot commented Jun 2, 2026

✅ Submodule Fast-Forward Check Results

✅ Submodules that are properly updated:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants