[CUDA][cuBLASLt] Fix scale setting for `allowFP16AccumulationCuBLAS` `true` case by eqy · Pull Request #153083 · pytorch/pytorch

eqy · 2025-05-07T19:06:10Z

Also add some missing @onlyCUDA / support check decorators in test_matmul_cuda.py
Should help resolve #151890

cc @ptrblck @msaroufim @jerryzh168 @csarofeen @xwang233

pytorch-bot · 2025-05-07T19:06:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153083

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 167c4a6 with merge base 590965f ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-focal-py3_9-clang9-xla / build (gh) (trunk failure)
ninja: build stopped: subcommand failed

This comment was automatically generated by Dr. CI and updates every 15 minutes.

test/test_matmul_cuda.py

janeyx99 · 2025-05-07T19:38:49Z

aten/src/ATen/cuda/CUDABlas.cpp

    cudaDeviceProp* prop = at::cuda::getCurrentDeviceProperties();
    if (prop->major >= 7 && at::globalContext().allowFP16AccumulationCuBLAS()) {
      computeType = CUBLAS_COMPUTE_16F;
+      scaleType = CUDA_R_16F;


janeyx99

looks ok

eqy · 2025-05-08T17:08:22Z

@pytorchmergebot merge

pytorchmergebot · 2025-05-08T17:11:01Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-05-08T20:52:37Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch rebase origin/main returned non-zero exit code 1

Rebasing (1/1)
Auto-merging test/test_matmul_cuda.py
CONFLICT (content): Merge conflict in test/test_matmul_cuda.py
error: could not apply c5d1ed74a17... [CUDA][cuBLASLt] Fix scale setting for `allowFP16AccumulationCuBLAS` `true` case (#153083)
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Could not apply c5d1ed74a17... [CUDA][cuBLASLt] Fix scale setting for `allowFP16AccumulationCuBLAS` `true` case (#153083)

Details for Dev Infra team

Raised by workflow job

eqy · 2025-05-08T21:05:05Z

@pytorchmergebot merge

pytorchmergebot · 2025-05-08T21:06:55Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

check in

8ce0ed5

eqy requested a review from syed-ahmed as a code owner May 7, 2025 19:06

eqy added module: cuda Related to torch.cuda, and CUDA support in general module: cublas Problem related to cublas support open source module: half Related to float16 half-precision floats topic: not user facing topic category labels May 7, 2025

janeyx99 reviewed May 7, 2025

View reviewed changes

test/test_matmul_cuda.py Outdated Show resolved Hide resolved

janeyx99 reviewed May 7, 2025

View reviewed changes

janeyx99 approved these changes May 7, 2025

View reviewed changes

janeyx99 added release notes: cuda release notes category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module and removed topic: not user facing topic category labels May 7, 2025

address comments

5f97fcd

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 8, 2025

pytorchmergebot added the merging label May 8, 2025

pytorchmergebot removed the merging label May 8, 2025

Merge branch 'main' into fixscalefp16

167c4a6

pytorchmergebot added the merging label May 8, 2025

pytorchmergebot added the Merged label May 9, 2025

pytorchmergebot closed this in b30d276 May 9, 2025

pytorchmergebot removed the merging label May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA][cuBLASLt] Fix scale setting for `allowFP16AccumulationCuBLAS` `true` case#153083

[CUDA][cuBLASLt] Fix scale setting for `allowFP16AccumulationCuBLAS` `true` case#153083
eqy wants to merge 3 commits intopytorch:mainfrom
eqy:fixscalefp16

eqy commented May 7, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented May 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

janeyx99 May 7, 2025

Uh oh!

janeyx99 left a comment

Uh oh!

eqy commented May 8, 2025

Uh oh!

pytorchmergebot commented May 8, 2025

Uh oh!

pytorchmergebot commented May 8, 2025

Uh oh!

eqy commented May 8, 2025

Uh oh!

pytorchmergebot commented May 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

eqy commented May 7, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153083

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Uh oh!

janeyx99 May 7, 2025

Choose a reason for hiding this comment

Uh oh!

janeyx99 left a comment

Choose a reason for hiding this comment

Uh oh!

eqy commented May 8, 2025

Uh oh!

pytorchmergebot commented May 8, 2025

Merge started

Uh oh!

pytorchmergebot commented May 8, 2025

Merge failed

Uh oh!

eqy commented May 8, 2025

Uh oh!

pytorchmergebot commented May 8, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eqy commented May 7, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented May 7, 2025 •

edited

Loading