Fix Dynamo `lru_cache` warnings during `torch.compile` by jiqing-feng · Pull Request #13384 · huggingface/diffusers

jiqing-feng · 2026-04-02T06:35:02Z

What does this PR do?

Fixes Dynamo lru_cache warnings when using torch.compile on diffusion pipelines. Two changes:

attention_dispatch.py: dispatch_attention_fn calls is_torch_version(">=", "2.5.0") at runtime, which is @lru_cache-wrapped. Replace with the existing module-level constant _CAN_USE_FLEX_ATTN so Dynamo never traces into it.
torch_utils.py: lru_cache_unless_export only bypasses lru_cache during torch.export (is_exporting). Add is_compiling check so torch.compile also bypasses the cache wrapper.

Reproduce

import torch
from diffusers import DiffusionPipeline

pipe = DiffusionPipeline.from_pretrained("black-forest-labs/FLUX.2-klein-4B", torch_dtype=torch.bfloat16).to("cpu")
pipe.transformer = torch.compile(pipe.transformer, backend="inductor")
pipe(prompt="a cat", height=256, width=256, num_inference_steps=1, generator=torch.Generator().manual_seed(0))
# Before: UserWarning about lru_cache in attention_dispatch.py / torch_utils.py
# After: no warning

out before fix:

/opt/venv/lib/python3.12/site-packages/torch/_dynamo/variables/functions.py:2435: UserWarning: Dynamo detected a call to a `functools.lru_cache`-wrapped function at 'attention_dispatch.py:426'. Dynamo ignores the cache wrapper and directly traces the wrapped function. Silent incorrectness is only a *potential* risk, not something we have observed. Enable TORCH_LOGS=+dynamo for a DEBUG stack trace.

This call originates from:
  File "/home/jiqing/diffusers/src/diffusers/models/attention_dispatch.py", line 426, in dispatch_attention_fn
    if is_torch_version(">=", "2.5.0"):

  torch._dynamo.utils.warn_once(msg)

Flux2PipelineOutput(images=[<PIL.Image.Image image mode=RGB size=256x256 at 0x7A540282FDA0>])

out after fix:

Flux2PipelineOutput(images=[<PIL.Image.Image image mode=RGB size=256x256 at 0x7A540282FDA0>])

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng · 2026-04-02T06:35:56Z

Hi @sayakpaul . Would you please review this PR? Thanks!

sayakpaul · 2026-04-02T08:50:44Z

        "_parallel_config": parallel_config,
    }
-    if is_torch_version(">=", "2.5.0"):
+    if _CAN_USE_FLEX_ATTN:


Is this a safe replacement? If so, could you elaborate further?

Yes, exactly the same. https://github.com/huggingface/diffusers/blob/main/src/diffusers/models/attention_dispatch.py#L62-L72

Added comments for it.

sayakpaul

Thanks for the PR. Left one comment.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

HuggingFaceDocBuilderDev · 2026-04-02T09:07:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jiqing-feng · 2026-04-03T01:37:19Z

Hi @sayakpaul . These failures are unrelated to this PR. They are caused by a missing key in peft==0.18.2.dev0's _MOE_TARGET_MODULE_MAPPING ('llava', 'qwen2_vl'), which is a pre-existing issue in the PEFT dev build. My changes only touch attention_dispatch.py (version check) and torch_utils.py (compile bypass), neither of which is in the LoRA/PEFT code path.

sayakpaul

Thanks for the PR! Failing test is unrelated.

…3384) * fix compile issue Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * compile friendly Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * add comments Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

jiqing-feng added 2 commits April 2, 2026 13:52

fix compile issue

13bcfe5

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

compile friendly

6a94fc6

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

sayakpaul reviewed Apr 2, 2026

View reviewed changes

add comments

a8366a0

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into compile

467f061

sayakpaul approved these changes Apr 3, 2026

View reviewed changes

sayakpaul merged commit a05c8e9 into huggingface:main Apr 3, 2026
10 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Dynamo `lru_cache` warnings during `torch.compile`#13384

Fix Dynamo `lru_cache` warnings during `torch.compile`#13384
sayakpaul merged 4 commits intohuggingface:mainfrom
jiqing-feng:compile

jiqing-feng commented Apr 2, 2026

Uh oh!

jiqing-feng commented Apr 2, 2026

Uh oh!

sayakpaul Apr 2, 2026

Uh oh!

jiqing-feng Apr 2, 2026

Uh oh!

jiqing-feng Apr 2, 2026

Uh oh!

sayakpaul left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 2, 2026

Uh oh!

jiqing-feng commented Apr 3, 2026 •

edited

Loading

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jiqing-feng commented Apr 2, 2026

What does this PR do?

Reproduce

Uh oh!

jiqing-feng commented Apr 2, 2026

Uh oh!

sayakpaul Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

jiqing-feng Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

jiqing-feng Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 2, 2026

Uh oh!

jiqing-feng commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jiqing-feng commented Apr 3, 2026 •

edited

Loading