futher scheduler changes for invoke_quant: prologue low prec, (slightly) more aggressive fusion by eellison · Pull Request #145104 · pytorch/pytorch

eellison · 2025-01-17T20:04:38Z

Stack from ghstack (oldest at bottom):

-> futher scheduler changes for invoke_quant: prologue low prec, (slightly) more aggressive fusion #145104

Respect invoke_quant low precision options, also, be more aggressive in attepmting fusion.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-01-17T20:04:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/145104

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit cf18367 with merge base 49082f9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: fbbbda4 Pull Request resolved: #145104

shunting314 · 2025-01-22T19:12:41Z

torch/_inductor/scheduler.py

@@ -3477,36 +3521,7 @@ def can_fuse(self, node1: BaseSchedulerNode, node2: BaseSchedulerNode) -> bool:
                )


Maybe move the whole if block regarding prologue fusion to can_fuse_prologue to make can_fuse smaller.

shunting314 · 2025-01-22T19:16:23Z

torch/_inductor/scheduler.py

+        """
+        Heuristics to avoid benchmarking predictably slow prologue fusions
+        """
+        # user opt into more aggressive prologue fusion, dont use heuristics


User opt in by using invoke_quant?

jansel

Failing tests?

[ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: 74e7765 Pull Request resolved: #145104

[ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: 676617b Pull Request resolved: #145104

jansel · 2025-01-29T01:38:47Z

torch/_inductor/graph.py

+        self.low_precision_codegen_ops = OrderedSet[str]()
+        # more aggressive prologue fusion
+        self.invoke_quant_ops = OrderedSet[str]()


Suggested change

self.low_precision_codegen_ops = OrderedSet[str]()

# more aggressive prologue fusion

self.invoke_quant_ops = OrderedSet[str]()

self.low_precision_codegen_ops : OrderedSet[str] = OrderedSet()

# more aggressive prologue fusion

self.invoke_quant_ops : OrderedSet[str] = OrderedSet()

to avoid runtime calls to __getitem__.

…ec, (slightly) more aggressive fusion" Respect invoke_quant low precision options, also, be more aggressive in attepmting fusion. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: 601adc8 Pull Request resolved: #145104

…ec, (slightly) more aggressive fusion" Respect invoke_quant low precision options, also, be more aggressive in attepmting fusion. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang aakhundov [ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: 7247fdf Pull Request resolved: #145104

[ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: 7ca092b Pull Request resolved: #145104

[ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: e5af8b5 Pull Request resolved: #145104

[ghstack-poisoned]

…ly) more aggressive fusion ghstack-source-id: e04f409 Pull Request resolved: #145104

eellison · 2025-02-09T22:03:43Z

@pytorchbot merge

pytorchmergebot · 2025-02-09T22:05:27Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-02-10T04:04:04Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

eellison · 2025-02-10T15:48:09Z

@pytorchbot merge -f "rocm test taking a while"

pytorchmergebot · 2025-02-10T15:50:05Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

9001794

[ghstack-poisoned]

eellison requested a review from zou3519 as a code owner January 17, 2025 20:04

eellison mentioned this pull request Jan 17, 2025

Parallelize epilogue/prologue benchmarking #143408

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels Jan 17, 2025

eellison mentioned this pull request Jan 17, 2025

[Inductor changes] Invoke Quant #139102

Closed

Update

dd4556e

[ghstack-poisoned]

Update

465993e

[ghstack-poisoned]

eellison added a commit that referenced this pull request Jan 17, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

22408a7

…ly) more aggressive fusion ghstack-source-id: fbbbda4 Pull Request resolved: #145104

pytorch-bot bot temporarily deployed to upload-benchmark-results January 17, 2025 20:55 Inactive

eellison requested review from Chillee and drisspg January 17, 2025 21:00

eellison added the topic: not user facing topic category label Jan 17, 2025

eellison requested review from jansel and shunting314 January 21, 2025 20:18

shunting314 approved these changes Jan 22, 2025

View reviewed changes

jansel requested changes Jan 23, 2025

View reviewed changes

Update

6a99f98

[ghstack-poisoned]

eellison added a commit that referenced this pull request Jan 28, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

5c09451

…ly) more aggressive fusion ghstack-source-id: 74e7765 Pull Request resolved: #145104

pytorch-bot bot temporarily deployed to upload-benchmark-results January 28, 2025 01:30 Inactive

Update

cb25d30

[ghstack-poisoned]

eellison added a commit that referenced this pull request Jan 29, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

a4499cb

…ly) more aggressive fusion ghstack-source-id: 676617b Pull Request resolved: #145104

pytorch-bot bot temporarily deployed to upload-benchmark-results January 29, 2025 01:23 Inactive

pytorch-bot bot had a problem deploying to upload-benchmark-results January 29, 2025 01:23 Failure

jansel approved these changes Jan 29, 2025

View reviewed changes

pytorch-bot bot temporarily deployed to upload-benchmark-results January 30, 2025 00:51 Inactive

eellison added a commit that referenced this pull request Jan 30, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

83cef96

…ly) more aggressive fusion ghstack-source-id: 601adc8 Pull Request resolved: #145104

eellison added a commit that referenced this pull request Jan 31, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

41aca13

…ly) more aggressive fusion ghstack-source-id: 7247fdf Pull Request resolved: #145104

Update

a5919e3

[ghstack-poisoned]

eellison added a commit that referenced this pull request Feb 6, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

c0a09c0

…ly) more aggressive fusion ghstack-source-id: 7ca092b Pull Request resolved: #145104

jansel approved these changes Feb 7, 2025

View reviewed changes

Update

12f54f3

[ghstack-poisoned]

Update

b960749

[ghstack-poisoned]

eellison added a commit that referenced this pull request Feb 8, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

85ab559

…ly) more aggressive fusion ghstack-source-id: e5af8b5 Pull Request resolved: #145104

Update

cf18367

[ghstack-poisoned]

eellison added a commit that referenced this pull request Feb 9, 2025

futher scheduler changes for invoke_quant: prologue low prec, (slight…

1c7454e

…ly) more aggressive fusion ghstack-source-id: e04f409 Pull Request resolved: #145104

pytorchmergebot added the merging label Feb 9, 2025

pytorchmergebot added the Merged label Feb 10, 2025

pytorchmergebot closed this in a36c22f Feb 10, 2025

pytorchmergebot removed the merging label Feb 10, 2025

github-actions bot deleted the gh/eellison/752/head branch March 13, 2025 02:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

futher scheduler changes for invoke_quant: prologue low prec, (slightly) more aggressive fusion#145104

futher scheduler changes for invoke_quant: prologue low prec, (slightly) more aggressive fusion#145104
eellison wants to merge 13 commits intogh/eellison/752/basefrom
gh/eellison/752/head

eellison commented Jan 17, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 17, 2025 •

edited

Loading

Uh oh!

shunting314 Jan 22, 2025

Uh oh!

shunting314 Jan 22, 2025

Uh oh!

jansel left a comment

Uh oh!

jansel Jan 29, 2025

Uh oh!

eellison commented Feb 9, 2025

Uh oh!

pytorchmergebot commented Feb 9, 2025

Uh oh!

pytorchmergebot commented Feb 10, 2025

Uh oh!

eellison commented Feb 10, 2025

Uh oh!

pytorchmergebot commented Feb 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -3477,36 +3521,7 @@ def can_fuse(self, node1: BaseSchedulerNode, node2: BaseSchedulerNode) -> bool:
		)

Conversation

eellison commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/145104

⏳ No Failures, 1 Pending

Uh oh!

shunting314 Jan 22, 2025

Choose a reason for hiding this comment

Uh oh!

shunting314 Jan 22, 2025

Choose a reason for hiding this comment

Uh oh!

jansel left a comment

Choose a reason for hiding this comment

Uh oh!

jansel Jan 29, 2025

Choose a reason for hiding this comment

Uh oh!

eellison commented Feb 9, 2025

Uh oh!

pytorchmergebot commented Feb 9, 2025

Merge started

Uh oh!

pytorchmergebot commented Feb 10, 2025

Uh oh!

eellison commented Feb 10, 2025

Uh oh!

pytorchmergebot commented Feb 10, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eellison commented Jan 17, 2025 •

edited

Loading

pytorch-bot bot commented Jan 17, 2025 •

edited

Loading