Update training script and test configurations for MIMO LLaVA by kamran-nvidia · Pull Request #3293 · NVIDIA-NeMo/Megatron-Bridge

kamran-nvidia · 2026-04-13T00:52:27Z

Adjusted training iterations from 2000 to 1000 in run_hetero_llava.sh
Renamed experiment in wandb from "mimo-llava-e2e-test" to "mimo-llava-hetero-e2e-test"
Removed unused parallelism configurations in run_hetero_llava_parallelism_tests_unfrozen_llm.sh
Introduced CLIPViTNoCLS class in test_mimo_training_llava.py to drop CLS token
Updated encoder sequence length to 576 in test_mimo_training_llava.py
Modified argument parsing for freeze options to use custom boolean parser

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Add specific line by line info of high level changes in this PR.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

- Adjusted training iterations from 2000 to 1000 in run_hetero_llava.sh - Renamed experiment in wandb from "mimo-llava-e2e-test" to "mimo-llava-hetero-e2e-test" - Removed unused parallelism configurations in run_hetero_llava_parallelism_tests_unfrozen_llm.sh - Introduced CLIPViTNoCLS class in test_mimo_training_llava.py to drop CLS token - Updated encoder sequence length to 576 in test_mimo_training_llava.py - Modified argument parsing for freeze options to use custom boolean parser Signed-off-by: Kamran Jafari <kjafarisadeg@nvidia.com>

copy-pr-bot · 2026-04-13T00:52:31Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

kamran-nvidia marked this pull request as ready for review April 13, 2026 14:14

kamran-nvidia requested a review from liding-nv April 13, 2026 14:15

liding-nv approved these changes Apr 13, 2026

View reviewed changes

liding-nv merged commit d1a37ee into mimo/phase5-checkpointing-rebuild Apr 13, 2026
2 checks passed

liding-nv deleted the kamran/mimo_llava_fixes branch April 13, 2026 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update training script and test configurations for MIMO LLaVA#3293

Update training script and test configurations for MIMO LLaVA#3293
liding-nv merged 1 commit into
mimo/phase5-checkpointing-rebuildfrom
kamran/mimo_llava_fixes

kamran-nvidia commented Apr 13, 2026

Uh oh!

copy-pr-bot Bot commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kamran-nvidia commented Apr 13, 2026

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants