[docs] Model merging by stevhliu · Pull Request #1423 · huggingface/peft

stevhliu · 2024-01-31T21:46:52Z

A guide to new model merging methods introduced in #1364.

todo:

add API reference for merging utilities (once the other PR is merged, I'll rerun the build_pr_documentation test and it should pass)
test and run code examples

HuggingFaceDocBuilderDev · 2024-01-31T21:50:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

prateeky2806 · 2024-02-12T01:47:27Z

Hi, is there an estimated timeline about by when this would be merged ?

stevhliu · 2024-02-12T16:59:56Z

Hi, is there an estimated timeline about by when this would be merged ?

Hi, it should be ready in the next few days and at the end of the week by the latest if there are no major issues!

prateeky2806 · 2024-02-13T01:27:48Z

+adapters = ["norobots", "adcopy", "sql"]
+weights = [2.0, 0.3, 0.7]
+adapter_name = "merge"
+density = 0.2


I am not sure how this density parameter works in dare_ties, I am assuming it is used to keep 20% params and then rescale as in DARE. However, I am not sure if we do TIES on top of it then will it again only keep the 20% params of the pruned and rescaled checkpoint essentially leading to 0.2*0.2 *100 = 4% remaining parameter or if it will keep 20% of the parameters. This is not a comment on the documentation but this behaviour is not very clear.

Hello, in dare_ties, first random pruning happens based on density followed by scaling. After this, the majority_sign_mask and disjoint_merge are performed similar to the ties method. So, pruning is taken from dare which is random and rescaled followed by majority sign and disjoint merge from ties.

sayakpaul · 2024-02-14T01:14:57Z

+config = PeftConfig.from_pretrained("smangrul/tinyllama_lora_norobots")
+model = AutoModelForCausalLM.from_pretrained(config.base_model_name_or_path, load_in_4bit=True, device_map="auto")
+tokenizer = AutoTokenizer.from_pretrained("smangrul/tinyllama_lora_norobots")
+
+model = PeftModel.from_pretrained(model, "smangrul/tinyllama_lora_norobots", adapter_name="norobots")
+_ = model.load_adapter("smangrul/tinyllama_lora_sql", adapter_name="sql")
+_ = model.load_adapter("smangrul/tinyllama_lora_adcopy", adapter_name="adcopy")


@pacman100 @BenjaminBossan makes sense to have copies of these checkpoints under the PEFT testing org?

Yes, let's try to put those artifacts on https://huggingface.co/peft-internal-testing.

sayakpaul

Looking solid!

BenjaminBossan

Nicely done. On top of what has already been mentioned, I just have one comment about there actually being more than 2 methods. Otherwise, this LGTM.

* content * code snippets * api reference * update * feedback * feedback

stevhliu commented Jan 31, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

pacman100 mentioned this pull request Feb 1, 2024

Add new merging methods #1364

Merged

3 tasks

prateeky2806 reviewed Feb 3, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md

prateeky2806 reviewed Feb 3, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md

stevhliu force-pushed the model-merging branch from cb44c70 to 9a84309 Compare February 12, 2024 16:49

stevhliu mentioned this pull request Feb 12, 2024

[docs] Docstring typo #1455

Merged

stevhliu added 4 commits February 12, 2024 09:56

content

0d0eff4

code snippets

4faa9a0

api reference

b8badee

update

bde2219

stevhliu force-pushed the model-merging branch from 9a84309 to bde2219 Compare February 12, 2024 17:57

stevhliu requested review from BenjaminBossan, pacman100 and prateeky2806 February 12, 2024 18:22

stevhliu marked this pull request as ready for review February 12, 2024 18:23

prateeky2806 reviewed Feb 13, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

feedback

b5ed8c8

prateeky2806 approved these changes Feb 13, 2024

View reviewed changes

sayakpaul reviewed Feb 14, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

sayakpaul reviewed Feb 14, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

sayakpaul reviewed Feb 14, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

sayakpaul reviewed Feb 14, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

sayakpaul reviewed Feb 14, 2024

View reviewed changes

Comment thread docs/source/package_reference/merge_utils.md Outdated

sayakpaul approved these changes Feb 14, 2024

View reviewed changes

BenjaminBossan approved these changes Feb 14, 2024

View reviewed changes

Comment thread docs/source/developer_guides/model_merging.md Outdated

Comment thread docs/source/developer_guides/model_merging.md Outdated

feedback

a4f772a

BenjaminBossan mentioned this pull request Feb 15, 2024

add magnitude_prune merging method #1466

Merged

stevhliu merged commit cde8f1a into huggingface:main Feb 15, 2024

stevhliu deleted the model-merging branch February 15, 2024 16:13

BenjaminBossan pushed a commit to BenjaminBossan/peft that referenced this pull request Mar 14, 2024

[docs] Model merging (huggingface#1423)

66691d5

* content * code snippets * api reference * update * feedback * feedback

Guy-Bilitski pushed a commit to Guy-Bilitski/peft that referenced this pull request May 13, 2025

[docs] Model merging (huggingface#1423)

b7a0cdf

* content * code snippets * api reference * update * feedback * feedback

Conversation

stevhliu commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 31, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

prateeky2806 commented Feb 12, 2024

Uh oh!

stevhliu commented Feb 12, 2024

Uh oh!

Uh oh!

Uh oh!

prateeky2806 Feb 13, 2024

Choose a reason for hiding this comment

Uh oh!

pacman100 Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sayakpaul Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

stevhliu commented Jan 31, 2024 •

edited

Loading