[examples] Add dynamic context parallel example by ilml · Pull Request #3892 · NVIDIA-NeMo/Megatron-Bridge

ilml · 2026-05-19T20:16:18Z

Summary

Add a minimal long-context Dynamic CP packing demo; it does not train a model or launch distributed workers.
Use Megatron-Core dev DefaultDynamicCPScheduler to schedule toy variable-length samples, then print the packed THD metadata per DPxCP rank: tokens.shape, cu_seqlens, max_seqlen, and local_cp_size.
Add examples/training_features/long_context/README.md explaining how to run the demo, how to read the output, and how the printed metadata maps to the real DCP forward-step path.
Show the Bridge config knobs users set in a real run: dynamic_context_parallel=True, sequence_packing_scheduler="default_dynamic_cp", max_seqlen_per_dp_cp_rank, min_dynamic_context_parallel_size, and micro_batch_size=1.
Pass Dynamic CP initialization kwargs through Bridge distributed setup when the installed MCore supports them, so dev Dynamic CP groups are created correctly after the MCore bump.

Testing

python -m py_compile examples/training_features/long_context/dynamic_context_parallel.py
git diff --check
tmux window 1, Docker container 7366a763896a after ./scripts/switch_mcore.sh dev + uv sync: uv run python examples/training_features/long_context/dynamic_context_parallel.py (prints per-sample gpus_needed and scheduled packed microbatches with local_cp_size)
tmux window 1: uv run ruff check examples/training_features/long_context/dynamic_context_parallel.py src/megatron/bridge/training/initialize.py
tmux window 1: uv run ruff format --check examples/training_features/long_context/dynamic_context_parallel.py src/megatron/bridge/training/initialize.py
tmux window 1: uv run pre-commit run --all-files

copy-pr-bot · 2026-05-19T20:16:22Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: ilml <tolong@nvidia.com>

cuichenx · 2026-05-28T00:02:57Z

/ok to test 51bb3c5

Signed-off-by: ilml <tolong@nvidia.com> Signed-off-by: Vasudevan Rengasamy <vrengasamy@nvidia.com>

ilml marked this pull request as draft May 19, 2026 21:13

ilml force-pushed the codex/dynamic-cp-example branch 5 times, most recently from ec5a517 to 529ce57 Compare May 27, 2026 17:39

[examples] feat: add dynamic context parallel example

c2df21f

Signed-off-by: ilml <tolong@nvidia.com>

ilml force-pushed the codex/dynamic-cp-example branch from 529ce57 to c2df21f Compare May 27, 2026 21:34

ilml marked this pull request as ready for review May 27, 2026 22:33

Merge branch 'main' into codex/dynamic-cp-example

48b21f1

yaoyu-33 previously approved these changes May 27, 2026

View reviewed changes

yaoyu-33 added area:training Training loop, callbacks, and runtime integration community-request feature New capabilities, enhancements, or enablement work needs-review PR is ready for code review and waiting on a reviewer labels May 27, 2026

[examples] docs: add long-context README

51bb3c5

Signed-off-by: ilml <tolong@nvidia.com>

ilml dismissed yaoyu-33’s stale review via 51bb3c5 May 27, 2026 23:08

cuichenx approved these changes May 28, 2026

View reviewed changes

github-actions Bot removed the community-request label May 28, 2026

copy-pr-bot Bot temporarily deployed to public May 28, 2026 00:03 Inactive

copy-pr-bot Bot temporarily deployed to test May 28, 2026 00:03 Inactive

copy-pr-bot Bot temporarily deployed to public May 28, 2026 00:11 Inactive

copy-pr-bot Bot temporarily deployed to public May 28, 2026 00:31 Inactive

cuichenx merged commit d1067ce into NVIDIA-NeMo:main May 28, 2026
98 of 99 checks passed

cuichenx mentioned this pull request May 28, 2026

[NeMo FW 26.06 Release] MBridge v0.5.0 Roadmap #3754

Open

vasunvidia pushed a commit to vasunvidia/Megatron-Bridge that referenced this pull request Jun 10, 2026

[examples] Add dynamic context parallel example (NVIDIA-NeMo#3892)

448f345

Signed-off-by: ilml <tolong@nvidia.com> Signed-off-by: Vasudevan Rengasamy <vrengasamy@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[examples] Add dynamic context parallel example#3892

[examples] Add dynamic context parallel example#3892
cuichenx merged 3 commits into
NVIDIA-NeMo:mainfrom
ilml:codex/dynamic-cp-example

ilml commented May 19, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented May 19, 2026

Uh oh!

cuichenx commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ilml commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

copy-pr-bot Bot commented May 19, 2026

Uh oh!

cuichenx commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ilml commented May 19, 2026 •

edited

Loading