[data] fix: Support Energon 7 metadata fields#4089
Conversation
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
|
/ok to test 6f742fc |
|
test |
Light ReviewClean compatibility layer — centralizing the Energon 6/7 metadata logic into metadata.py is the right call. Findings
Overall the change looks correct and well-scoped. Suggested test cases No perf tests impacted. |
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com> Signed-off-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
|
/ok to test c0215a2 |
Summary
Original Bump
uv.lock(main, mcore-main) (2026-05-30) #4087main52d1d681d6000e5da867b9b100c14648e4f85de9updates the MCore Energon requirement from~=6.0to~=7.0, and the bump lockfile movesmegatron-energonfrom6.0.1to7.3.2. Energon 7 removes the singular sample__subflavor__field and requires batch__key__/__restore_key__metadata, which broke Bridge unit tests in PR chore(beep boop 🤖): Bumpuv.lock(main, mcore-main) (2026-05-30) #4087.src/megatron/bridge/data/energon/metadata.pywith TODO removal condition: remove when Bridge no longer supportsmegatron-energon 6.x. Removed stale guards: none.Validation
uv run pre-commit run --all-files- passed.uv run python -m pytest tests/unit_tests/data/energon/test_hf_encoder_task_encoder.py tests/unit_tests/data/energon/test_nemotron_omni_task_encoder.py tests/unit_tests/recipes/qwen_vl/data/energon/test_task_encoder.py tests/unit_tests/diffusion/data/common/test_diffusion_sample.py tests/unit_tests/diffusion/data/common/test_diffusion_task_encoder.py tests/unit_tests/diffusion/data/flux/test_flux_taskencoder.py tests/unit_tests/diffusion/data/wan/test_wan_taskencoder.py tests/unit_tests/models/nemotron_omni/test_nemotron_omni_conversion.py -q- passed, 59 passed, 32 warnings in 1.24s.