[Llama FA2] Re-add _expand_attention_mask and clean a couple things by patrickvonplaten · Pull Request #27074 · huggingface/transformers

patrickvonplaten · 2023-10-25T21:50:55Z

What does this PR do?

This PR cleans the attention mask converter a bit more, corrects some docstrings and removes outdated comments and deprecates _expand_attention_mask to fix optimum.

src/transformers/models/llama/modeling_llama.py

…mers into clean_llama

patrickvonplaten · 2023-10-25T22:28:18Z

@ArthurZucker could you give this a quick review? It'd make the Bart FA PR much easier to continue and should also fix the better transformers problem with optimum

HuggingFaceDocBuilderDev · 2023-10-25T22:39:41Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker · 2023-10-26T07:56:44Z

Of course!

src/transformers/models/llama/modeling_llama.py

ArthurZucker · 2023-10-26T08:00:59Z

src/transformers/models/llama/modeling_llama.py

+def _expand_mask(mask: torch.Tensor, dtype: torch.dtype, tgt_len: Optional[int] = None):
+    warnings.warn(
+        "Calling `transformers.models.llama.modeling_llama._expand_mask` is deprecated and will be removed in v4.37. Use `transformers.models.llama.modeling_llama.AttnMaskConverter._expand_mask"
+    )
+    return AttnMaskConverter._expand_mask(mask=mask, dtype=dtype, tgt_len=tgt_len)


Nice! We should probably do the same for falcon and mistral as well

Think in optimum only the llama mask utils are imported: https://github.com/huggingface/optimum/blob/313e1bd0de2b44aaa71797464f1e8b6a041a6f18/optimum/bettertransformer/models/attention.py#L25

ok 👍🏻

src/transformers/models/llama/modeling_llama.py

…uggingface#27074) * clean * clean llama * fix more * make style * Apply suggestions from code review * Apply suggestions from code review * Update src/transformers/models/llama/modeling_llama.py * Update src/transformers/models/llama/modeling_llama.py * Apply suggestions from code review * finish * make style

patrickvonplaten added 4 commits October 25, 2023 23:37

clean

ed155a4

clean llama

b3d8ad4

fix more

61a9d85

make style

beb1060

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Show resolved Hide resolved

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Apply suggestions from code review

cfb1fc8

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Apply suggestions from code review

58e2bd4

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

Update src/transformers/models/llama/modeling_llama.py

a38c96d

patrickvonplaten commented Oct 25, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten added 2 commits October 26, 2023 00:09

Update src/transformers/models/llama/modeling_llama.py

fdbc754

Merge branch 'clean_llama' of https://github.com/huggingface/transfor…

867aa24

…mers into clean_llama

patrickvonplaten changed the title ~~clean~~ [Llama FA2] Re-add _expand_attention_mask and clean a couple things Oct 25, 2023

patrickvonplaten mentioned this pull request Oct 25, 2023

ImportError using Llama 2 with BetterTransformer huggingface/optimum#1481

Closed

4 tasks

patrickvonplaten requested a review from ArthurZucker October 25, 2023 22:27

ArthurZucker approved these changes Oct 26, 2023

View reviewed changes

patrickvonplaten commented Oct 26, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten commented Oct 26, 2023

View reviewed changes

src/transformers/models/llama/modeling_llama.py Outdated Show resolved Hide resolved

patrickvonplaten added 3 commits October 26, 2023 12:14

Apply suggestions from code review

3882d35

finish

4e87b9b

make style

76e2b3f

patrickvonplaten merged commit d7cb5e1 into main Oct 26, 2023

patrickvonplaten deleted the clean_llama branch October 26, 2023 11:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Llama FA2] Re-add _expand_attention_mask and clean a couple things#27074

[Llama FA2] Re-add _expand_attention_mask and clean a couple things#27074
patrickvonplaten merged 12 commits intomainfrom
clean_llama

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading

Uh oh!

ArthurZucker commented Oct 26, 2023

Uh oh!

Uh oh!

ArthurZucker Oct 26, 2023

Uh oh!

patrickvonplaten Oct 26, 2023

Uh oh!

ArthurZucker Oct 26, 2023

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

patrickvonplaten commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker commented Oct 26, 2023

Uh oh!

Uh oh!

ArthurZucker Oct 26, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Oct 26, 2023

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 26, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 25, 2023 •

edited

Loading