[inductor] Make addcmul/addcdiv decomp skip unconditional and add another decomp by mlazos · Pull Request #175839 · pytorch/pytorch

mlazos · 2026-02-26T07:52:16Z

Stack from ghstack (oldest at bottom):

The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by
emulate_precision_casts), but the decomposition skip in select_decomp_table()
was still gated by that config. This meant the decompositions would override
the FMA lowerings when emulate_precision_casts=False.

Make the decomp skip unconditional to match the lowerings. Also add
aten.addcmul_ (in-place) to the skip list.

Authored with Claude.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo

[ghstack-poisoned]

pytorch-bot · 2026-02-26T07:52:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/175839

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit c721804 with merge base a6beff3 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / inductor-test / test (inductor_torchbench, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
drq

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

inductor / inductor-cpu-test / test (cpu_inductor_torchbench, 1, 2, linux.2xlarge.amx, unstable) (gh) (#174929)
detectron2_maskrcnn_r_50_fpn

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot · 2026-02-26T07:52:24Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by emulate_precision_casts), but the decomposition skip in select_decomp_table() was still gated by that config. This meant the decompositions would override the FMA lowerings when emulate_precision_casts=False. Make the decomp skip unconditional to match the lowerings. Also add aten.addcmul_ (in-place) to the skip list. Authored with Claude. ghstack-source-id: facc355 Pull-Request: #175839

[ghstack-poisoned]

The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by emulate_precision_casts), but the decomposition skip in select_decomp_table() was still gated by that config. This meant the decompositions would override the FMA lowerings when emulate_precision_casts=False. Make the decomp skip unconditional to match the lowerings. Also add aten.addcmul_ (in-place) to the skip list. Authored with Claude. ghstack-source-id: debdcbf Pull-Request: #175839

[ghstack-poisoned]

The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by emulate_precision_casts), but the decomposition skip in select_decomp_table() was still gated by that config. This meant the decompositions would override the FMA lowerings when emulate_precision_casts=False. Make the decomp skip unconditional to match the lowerings. Also add aten.addcmul_ (in-place) to the skip list. Authored with Claude. ghstack-source-id: 34ddc76 Pull-Request: #175839

[ghstack-poisoned]

pytorchmergebot · 2026-03-03T07:16:30Z

Starting merge as part of PR stack under #174911

Add CompiledOptimizerBitwiseTests test suite that verifies compiled optimizers produce bitwise identical results to eager when precision configs are enabled: - eager_numerics.division_rounding = True - eager_numerics.pow_precision = True - emulate_precision_casts = True Tests cover Adam and AdamW with various configurations including amsgrad, maximize, and weight_decay options. Pull Request resolved: #174911 Approved by: https://github.com/v0i0 ghstack dependencies: #176237, #175839

…ther decomp (pytorch#175839) The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by emulate_precision_casts), but the decomposition skip in select_decomp_table() was still gated by that config. This meant the decompositions would override the FMA lowerings when emulate_precision_casts=False. Make the decomp skip unconditional to match the lowerings. Also add aten.addcmul_ (in-place) to the skip list. Authored with Claude. Pull Request resolved: pytorch#175839 Approved by: https://github.com/v0i0 ghstack dependencies: pytorch#176237

Add CompiledOptimizerBitwiseTests test suite that verifies compiled optimizers produce bitwise identical results to eager when precision configs are enabled: - eager_numerics.division_rounding = True - eager_numerics.pow_precision = True - emulate_precision_casts = True Tests cover Adam and AdamW with various configurations including amsgrad, maximize, and weight_decay options. Pull Request resolved: pytorch#174911 Approved by: https://github.com/v0i0 ghstack dependencies: pytorch#176237, pytorch#175839

The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by emulate_precision_casts), but the decomposition skip in select_decomp_table() was still gated by that config. This meant the decompositions would override the FMA lowerings when emulate_precision_casts=False. Make the decomp skip unconditional to match the lowerings. Also add aten.addcmul_ (in-place) to the skip list. Authored with Claude. ghstack-source-id: 8ac9046 Pull-Request: pytorch/pytorch#175839

…ther decomp (pytorch#175839) The FMA lowerings for addcmul/addcdiv are now unconditional (not gated by emulate_precision_casts), but the decomposition skip in select_decomp_table() was still gated by that config. This meant the decompositions would override the FMA lowerings when emulate_precision_casts=False. Make the decomp skip unconditional to match the lowerings. Also add aten.addcmul_ (in-place) to the skip list. Authored with Claude. Pull Request resolved: pytorch#175839 Approved by: https://github.com/v0i0 ghstack dependencies: pytorch#176237

Add CompiledOptimizerBitwiseTests test suite that verifies compiled optimizers produce bitwise identical results to eager when precision configs are enabled: - eager_numerics.division_rounding = True - eager_numerics.pow_precision = True - emulate_precision_casts = True Tests cover Adam and AdamW with various configurations including amsgrad, maximize, and weight_decay options. Pull Request resolved: pytorch#174911 Approved by: https://github.com/v0i0 ghstack dependencies: pytorch#176237, pytorch#175839

Update

5f9c7b6

[ghstack-poisoned]

pytorch-bot Bot added ciflow/inductor module: inductor labels Feb 26, 2026

mlazos changed the title ~~[inductor] Make addcmul/addcdiv decomp skip unconditional~~ [inductor] Make addcmul/addcdiv decomp skip unconditional and add another decomp Feb 26, 2026

mlazos added 4 commits February 26, 2026 00:01

Update

4d3259c

[ghstack-poisoned]

Update

256355e

[ghstack-poisoned]

Update

7c422f8

[ghstack-poisoned]

Update

641c721

[ghstack-poisoned]

mlazos added 3 commits February 26, 2026 16:58

Update

b237451

[ghstack-poisoned]

Update

a04b083

[ghstack-poisoned]

Update

66b4c92

[ghstack-poisoned]

v0i0 approved these changes Feb 27, 2026

View reviewed changes

mlazos added 2 commits February 27, 2026 15:01

Update

eea27b9

[ghstack-poisoned]

Update

848ca31

[ghstack-poisoned]

mlazos added ciflow/trunk Trigger trunk jobs on your pull request release notes: inductor labels Feb 28, 2026

mlazos added 2 commits February 28, 2026 12:37

Update

882d213

[ghstack-poisoned]

Update

146e3b4

[ghstack-poisoned]

Update

38b7ba9

[ghstack-poisoned]

mlazos mentioned this pull request Mar 2, 2026

[inductor] Add _foreach_addcdiv lowering to match _foreach_addcmul #176237

Closed

mlazos added 3 commits March 2, 2026 15:56

Update

2b2bbf8

[ghstack-poisoned]

Update

210ee91

[ghstack-poisoned]

Update

c721804

[ghstack-poisoned]

pytorchmergebot closed this in c38518f Mar 3, 2026

pytorchmergebot added the Merged label Mar 3, 2026

github-actions Bot deleted the gh/mlazos/110/head branch April 3, 2026 02:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inductor] Make addcmul/addcdiv decomp skip unconditional and add another decomp#175839

[inductor] Make addcmul/addcdiv decomp skip unconditional and add another decomp#175839
mlazos wants to merge 16 commits intogh/mlazos/110/basefrom
gh/mlazos/110/head

mlazos commented Feb 26, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Feb 26, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Feb 26, 2026

Uh oh!

pytorchmergebot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mlazos commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/175839

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

pytorch-bot Bot commented Feb 26, 2026

This PR needs a release notes: label

Uh oh!

pytorchmergebot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mlazos commented Feb 26, 2026 •

edited

Loading

pytorch-bot Bot commented Feb 26, 2026 •

edited

Loading

This PR needs a `release notes:` label