[tests] Stricter generate + compilation test -- no recompilations allowed by gante · Pull Request #37629 · huggingface/transformers

gante · 2025-04-19T13:24:40Z

What does this PR do?

Follow-up to #37447

This PR upgrades test_generate_compile_model_forward to catch recompilation issues. This is done by a) activating recompilation logs b) catching recompilation messages in the logs. The improved test would have failed with the changes that broke gemma 2/3 + compile 🤗

In the process, a few extra things were standardized in the test, and a few skips were removed.

github-actions · 2025-04-19T13:24:52Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

gante · 2025-04-19T13:26:48Z

src/transformers/generation/utils.py

-            else:
+            # the 4D causal mask exists, it should be present in the base model (XXXModel class) or in its decoder.
+            base_model = getattr(self, self.base_model_prefix, self)
+            decoder = base_model.get_decoder() if hasattr(base_model, "get_decoder") else None


this fixes the compilation test for opt and helps with decoder-only whisper

gante · 2025-04-19T13:28:14Z

cc @manueldeprada (since you mentioned you are interested in torch.compile)

HuggingFaceDocBuilderDev · 2025-04-19T13:51:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

Thanks for adding the test! So this means all VLMs are actually compiling without any breaks, right. I am still wondering about the slow downs we experienced when compiling certain models

One q about the test: it guards torch>=2.6, are the runner installing the latest version? Would be nice to get this test running on each PR to avoid accidentally breaking gemma again

zucchini-nlp · 2025-04-21T13:32:11Z

tests/generation/test_utils.py

+            # TODO (joao, raushan): do we need a custom `generate` in these models? can we call `super().generate`, as
+            # opposed to the inner model's `generate`? If yes, we would get a more standardized codebase
+            if "blip" in model.__class__.__name__.lower():


No, blip cannot be fixed anymore, we risk to break it for existing hub repos 😢 I had a branch somewhere, but it didn't work

And it's not worth fixing, we can even skip compile tests for blip if it's making our life harder

haha okay I'll remove the TODO 😢 (I'm a bit sad on the inside)

gante · 2025-04-21T14:36:23Z

@zucchini-nlp

So this means all VLMs are actually compiling without any breaks, right. I am still wondering about the slow downs we experienced when compiling certain models

Yes, they are compilable without graph breaks. That by itself is useful, it means they can probably be exported with torch.export 💪 We'll have to profile to understand why we don't observe as big speedups as in LLMs 🔍 It may be due to the number of input tokens, it may be due to suboptimal image processing ops, ...

One q about the test: it guards torch>=2.6, are the runner installing the latest version? Would be nice to get this test running on each PR to avoid accidentally breaking gemma again

Yes, they are running torch 2.6 (the runners' images install the latest version e.g. here).

On this PR's ci/circleci: tests_torch:

zucchini-nlp

Great, happy to hear CI actually runs tests!

tests/generation/test_utils.py

…owed (huggingface#37629) * tmp commit * stricter compilation test * trigger tests * rm todo

gante added 2 commits April 19, 2025 10:08

tmp commit

5692cfc

stricter compilation test

b761337

gante requested a review from zucchini-nlp April 19, 2025 13:24

github-actions bot marked this pull request as draft April 19, 2025 13:24

gante requested a review from Cyrilvallez April 19, 2025 13:24

gante marked this pull request as ready for review April 19, 2025 13:25

trigger tests

e01dd1f

gante commented Apr 19, 2025

View reviewed changes

Merge branch 'main' into compilation_tests_no_recompilation

a004a0b

zucchini-nlp reviewed Apr 21, 2025

View reviewed changes

rm todo

1fc4ac8

zucchini-nlp approved these changes Apr 21, 2025

View reviewed changes

gante merged commit 85665a4 into huggingface:main Apr 22, 2025
20 checks passed

gante deleted the compilation_tests_no_recompilation branch April 22, 2025 10:12

gante mentioned this pull request Apr 23, 2025

[Gemma3] compile ✨ #37447

Merged

2 tasks

ydshieh reviewed Apr 30, 2025

View reviewed changes

tests/generation/test_utils.py Show resolved Hide resolved

gante mentioned this pull request Apr 30, 2025

[tests] reset logs in torch.compile test #37894

Merged

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

[tests] Stricter generate + compilation test -- no recompilations all…

2ba36c5

…owed (huggingface#37629) * tmp commit * stricter compilation test * trigger tests * rm todo

zucchini-nlp mentioned this pull request May 23, 2025

🔴[Attention] Attention refactor for Whisper-based models #38235

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tests] Stricter generate + compilation test -- no recompilations allowed#37629

[tests] Stricter generate + compilation test -- no recompilations allowed#37629
gante merged 5 commits intohuggingface:mainfrom
gante:compilation_tests_no_recompilation

gante commented Apr 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

gante Apr 19, 2025

Uh oh!

gante commented Apr 19, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 19, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

zucchini-nlp Apr 21, 2025

Uh oh!

gante Apr 21, 2025

Uh oh!

gante commented Apr 21, 2025 •

edited

Loading

Uh oh!

zucchini-nlp left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

gante commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

gante Apr 19, 2025

Choose a reason for hiding this comment

Uh oh!

gante commented Apr 19, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 19, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

gante Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

gante commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gante commented Apr 19, 2025 •

edited

Loading

gante commented Apr 21, 2025 •

edited

Loading