[MPS] Add cholesky_solve support by Kingwl · Pull Request #176703 · pytorch/pytorch

Kingwl · 2026-03-06T09:05:58Z

As decomposition via two triangular solves

Frequently requested op in #154052

pytorch-bot · 2026-03-06T09:06:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/176703

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 2738fc0 with merge base 08b6f48 ():

NEW FAILURE - The following job has failed:

pull / linux-jammy-py3.14t-clang15 / test (crossref, 2, 2, lf.linux.2xlarge) (gh)
##[error]Response status code does not indicate success: 401 (Unauthorized).

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2026-03-06T09:06:08Z

The committers listed above are authorized under a signed CLA.

✅ login: Kingwl / name: Wenlu Wang (2738fc0)

github-actions · 2026-03-06T09:09:54Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

test/test_mps.py

aten/src/ATen/native/mps/operations/LinearAlgebra.mm

pytorch-bot · 2026-03-11T00:48:51Z

To add the ciflow label ciflow/mps please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

aten/src/ATen/native/mps/operations/LinearAlgebra.mm

kurtamohler

Thank you for the PR! Left a few suggestions

kurtamohler · 2026-03-12T01:01:34Z

@pytorchbot rebase

pytorchmergebot · 2026-03-12T01:03:19Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2026-03-12T01:03:22Z

Successfully rebased feat/support-cholesky-solve-for-mps onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout feat/support-cholesky-solve-for-mps && git pull --rebase)

kurtamohler · 2026-03-12T01:04:36Z

@pytorchbot label "ciflow/mps"

malfet · 2026-03-12T01:06:03Z

@pytorchbot merge

pytorchmergebot · 2026-03-12T01:08:10Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Kingwl · 2026-03-12T01:08:10Z

BTW: the solve triangular extremely slow on MPS. And It have no Idea how to be it faster after do some investigation. Any suggestion？😿

pytorchmergebot · 2026-03-12T02:42:38Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / macos-py3-arm64 / build

Details for Dev Infra team

Raised by workflow job

kurtamohler · 2026-03-12T03:41:16Z

BTW: the solve triangular extremely slow on MPS. And It have no Idea how to be it faster after do some investigation. Any suggestion？😿

Is it slow just for batched inputs with more than 2 dims, or is it also slow for non-batched inputs?

It seems that if the input is batched, each matrix is computed serially:

pytorch/aten/src/ATen/native/mps/operations/LinearAlgebra.mm

Line 1246 in f72a552

for (const auto i : c10::irange(batchSize)) {

So a major improvement would be to parallelize the batches. It looks like MPSMatrixSolveTriangular does not support batched matrices, so it would have to be implemented as a custom metal kernel.

malfet · 2026-03-12T14:57:08Z

@pytorchbot merge -i

pytorchmergebot · 2026-03-12T14:59:31Z

Merge started

Your change will be merged while ignoring the following 1 checks: pull / linux-jammy-py3.14t-clang15 / test (crossref, 2, 2, lf.linux.2xlarge)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

As decomposition via two triangular solves Frequently requested op in pytorch#154052 Pull Request resolved: pytorch#176703 Approved by: https://github.com/kurtamohler, https://github.com/malfet

Kingwl requested review from malfet and mruberry as code owners March 6, 2026 09:05

pytorch-bot bot added the release notes: mps Release notes category label Mar 6, 2026

pytorchbot added the open source label Mar 6, 2026

malfet added the ciflow/mps Run MPS tests (subset of trunk) label Mar 6, 2026

pytorch-bot bot removed the ciflow/mps Run MPS tests (subset of trunk) label Mar 6, 2026

Kingwl force-pushed the feat/support-cholesky-solve-for-mps branch 2 times, most recently from 458a6b7 to e5e17f7 Compare March 7, 2026 09:40