[Flux] Optimize guidance creation in flux pipeline by moving it outside the loop by chengzeyi · Pull Request #9153 · huggingface/diffusers

chengzeyi · 2024-08-12T06:36:50Z

What does this PR do?

Optimize guidance creation in flux pipeline by moving it outside the loop and using torch.full() instead of torch.tensor.
By doing so, we reduce number of the unnecessary implict CUDA synchronizations caused by creating a device tensor from a list.
I observe a little performance gain (1%-2%) by applying this fix.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…loop

…le value

sayakpaul · 2024-08-13T14:56:43Z

src/diffusers/pipelines/flux/pipeline_flux.py

+        # handle guidance
+        if self.transformer.config.guidance_embeds:
+            guidance = torch.full([1], guidance_scale, device=device, dtype=torch.float32)
+            guidance = guidance.expand(latents.shape[0])
+        else:
+            guidance = None


I like this!

sayakpaul

Nice, thank you!

HuggingFaceDocBuilderDev · 2024-08-13T15:03:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…de the loop (#9153) * optimize guidance creation in flux pipeline by moving it outside the loop * use torch.full instead of torch.tensor to create a tensor with a single value --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

chengzeyi added 2 commits August 12, 2024 06:29

optimize guidance creation in flux pipeline by moving it outside the …

5ce24d0

…loop

use torch.full instead of torch.tensor to create a tensor with a sing…

e572f9d

…le value

chengzeyi changed the title ~~[Flux] optimize guidance creation in flux pipeline by moving it outside the loop~~ [Flux] Optimize guidance creation in flux pipeline by moving it outside the loop Aug 12, 2024

a-r-r-o-w requested a review from sayakpaul August 13, 2024 14:46

sayakpaul reviewed Aug 13, 2024

View reviewed changes

sayakpaul approved these changes Aug 13, 2024

View reviewed changes

Merge branch 'main' into optimize_flux

4e76315

sayakpaul requested a review from DN6 August 13, 2024 14:56

Gothos mentioned this pull request Aug 14, 2024

Add Flux inpainting and Flux Img2Img #9135

Merged

5 tasks

DN6 merged commit e649678 into huggingface:main Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flux] Optimize guidance creation in flux pipeline by moving it outside the loop#9153

[Flux] Optimize guidance creation in flux pipeline by moving it outside the loop#9153
DN6 merged 3 commits intohuggingface:mainfrom
chengzeyi:optimize_flux

chengzeyi commented Aug 12, 2024 •

edited

Loading

Uh oh!

sayakpaul Aug 13, 2024

Uh oh!

sayakpaul left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

chengzeyi commented Aug 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul Aug 13, 2024

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chengzeyi commented Aug 12, 2024 •

edited

Loading