[Canonical LoRA] feat: Support more flexible target_modules by HollowMan6 · Pull Request #1799 · NVIDIA-NeMo/Megatron-Bridge

HollowMan6 · 2025-12-23T17:35:59Z

What does this PR do ?

e.g., for MLA (DeepSeek V3 arch models), we should also be able to support all linear layers LoRA via setting target_modules as: ["linear_kv_down_proj","linear_kv_up_proj","linear_q_down_proj","linear_q_up_proj","linear_q_proj","linear_proj","linear_fc1_up","linear_fc1_gate","linear_fc2"]

Changelog

Not hard coding the possible supported target_modules and using endswith to matching target module names instead.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

_{✨ Presented to you with Mind Lab - A Lab for Experiential Intelligence.}

e.g., for MLA (DeepSeek V3 arch models), we should also be able to support all linear layers LoRA via setting `target_modules` as: ["linear_kv_down_proj","linear_kv_up_proj","linear_q_down_proj","linear_q_up_proj","linear_q_proj","linear_proj","linear_fc1_up","linear_fc1_gate","linear_fc2"] Signed-off-by: Hollow Man <hollowman@opensuse.org>

copy-pr-bot · 2025-12-23T17:36:02Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

yaoyu-33 · 2025-12-24T03:44:24Z

/ok to test 5543bee

github-actions Bot added the community-request label Dec 23, 2025

HollowMan6 mentioned this pull request Dec 23, 2025

feat: Validate PEFT target modules #1747

Merged

yaoyu-33 approved these changes Dec 24, 2025

View reviewed changes

copy-pr-bot Bot temporarily deployed to nemo-ci December 24, 2025 03:44 Inactive

copy-pr-bot Bot temporarily deployed to test December 24, 2025 03:45 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci December 24, 2025 19:40 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci December 24, 2025 19:49 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci December 24, 2025 19:57 Inactive

yaoyu-33 merged commit 1c5fa2c into NVIDIA-NeMo:main Dec 26, 2025
49 checks passed

HollowMan6 deleted the canonical_mla branch December 26, 2025 09:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Canonical LoRA] feat: Support more flexible target_modules#1799

[Canonical LoRA] feat: Support more flexible target_modules#1799
yaoyu-33 merged 1 commit into
NVIDIA-NeMo:mainfrom
HollowMan6:canonical_mla

HollowMan6 commented Dec 23, 2025 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Dec 23, 2025

Uh oh!

yaoyu-33 commented Dec 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HollowMan6 commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented Dec 23, 2025

Uh oh!

yaoyu-33 commented Dec 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HollowMan6 commented Dec 23, 2025 •

edited

Loading