[`core`] fix silent bug `keep_in_fp32` modules by younesbelkada · Pull Request #26589 · huggingface/transformers

younesbelkada · 2023-10-04T12:50:07Z

What does this PR do?

Same PR as #26484 but without any extra diff

Before this PR we were performing a simple check if module_name in key but that lead to some modules silently converted in fp32.

For example instructblip models got their word_embedding layers converted in fp32 because _keep_in_fp32_modules includes "wo" which is contained in the string word_embedding. The fix is to check if module_name in key.split(".")

I can confirm with this PR the failing instructblip tests now pass

HuggingFaceDocBuilderDev · 2023-10-04T13:06:37Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada · 2023-10-04T14:47:04Z

Before merging this PR I want to test it on T5 models and 8bit tests as this might affect them

younesbelkada · 2023-10-04T14:47:28Z

tests/models/instructblip/test_modeling_instructblip.py

        model = InstructBlipForConditionalGeneration.from_pretrained(
            "Salesforce/instructblip-flan-t5-xl",
            torch_dtype=torch.bfloat16,
+            low_cpu_mem_usage=True,


This is just to make the loading of the model faster

younesbelkada · 2023-10-04T15:07:34Z

Relevant T5 and bnb tests are passing, this PR is ready for review!

younesbelkada · 2023-10-04T15:10:02Z

... and tested the failing instructblip tests on the latest docker image and they pass with these changes

LysandreJik · 2023-10-05T08:46:40Z

Thanks for the PR! Do we have a common test that could have failed in this specific instance? If not, would it be possible to work on one?

I'm a bit afraid of the repercussions of such a change without a test that ensures the modules that should be kept in fp32 actually are and that those that shouldn't are kept in their original dtype. It is fixing a silent error but also seems like it could break some silent successes

younesbelkada · 2023-10-05T08:47:49Z

Hi @LysandreJik - OK makes sense, I am happy to work on a common test for that - I'll ping you once this is done

younesbelkada · 2023-10-05T10:04:49Z

Tests are passing, this is ready for another review!

LysandreJik

Looks great! Thanks @younesbelkada

fix silent bug keep_in_fp32 modules

a1bd493

final fix

212576b

younesbelkada marked this pull request as ready for review October 4, 2023 14:45

younesbelkada commented Oct 4, 2023

View reviewed changes

younesbelkada requested review from LysandreJik and ydshieh October 4, 2023 14:47

younesbelkada added 2 commits October 5, 2023 09:20

added a common test.

e4615b7

Merge remote-tracking branch 'upstream/main' into fix-final-instructblip

18250f1

younesbelkada added 2 commits October 5, 2023 10:51

Trigger CI

3fad4d5

revert

2dc3b25

LysandreJik approved these changes Oct 5, 2023

View reviewed changes

younesbelkada merged commit e6d250e into huggingface:main Oct 5, 2023

younesbelkada deleted the fix-final-instructblip branch October 5, 2023 12:44

younesbelkada mentioned this pull request Oct 11, 2023

Avoid class attribute _keep_in_fp32_modules being modified #26433

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`core`] fix silent bug `keep_in_fp32` modules#26589

[`core`] fix silent bug `keep_in_fp32` modules#26589
younesbelkada merged 6 commits intohuggingface:mainfrom
younesbelkada:fix-final-instructblip

younesbelkada commented Oct 4, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 4, 2023 •

edited

Loading

Uh oh!

younesbelkada commented Oct 4, 2023

Uh oh!

younesbelkada Oct 4, 2023

Uh oh!

younesbelkada commented Oct 4, 2023

Uh oh!

younesbelkada commented Oct 4, 2023

Uh oh!

LysandreJik commented Oct 5, 2023

Uh oh!

younesbelkada commented Oct 5, 2023

Uh oh!

younesbelkada commented Oct 5, 2023

Uh oh!

LysandreJik left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

younesbelkada commented Oct 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada commented Oct 4, 2023

Uh oh!

younesbelkada Oct 4, 2023

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented Oct 4, 2023

Uh oh!

younesbelkada commented Oct 4, 2023

Uh oh!

LysandreJik commented Oct 5, 2023

Uh oh!

younesbelkada commented Oct 5, 2023

Uh oh!

younesbelkada commented Oct 5, 2023

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

younesbelkada commented Oct 4, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 4, 2023 •

edited

Loading