[MPS] cholesky ex version by Isalia20 · Pull Request #146799 · pytorch/pytorch

Isalia20 · 2025-02-09T18:48:32Z

PR #145701 didn't have experimental version of cholesky. This PR adds that version

pytorch-bot · 2025-02-09T18:48:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/146799

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 4ea9ce5 with merge base 91c4bf3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-02-09T18:52:42Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

pytorch-bot · 2025-02-11T05:31:41Z

To add the ciflow label ciflow/mps please first approve the workflows that are awaiting approval (scroll to the bottom of this page).

This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows.

aten/src/ATen/native/mps/kernels/LinearAlgebra.metal

aten/src/ATen/native/mps/operations/LinearAlgebra.mm

nikitaved · 2025-02-11T11:11:35Z

aten/src/ATen/native/mps/operations/LinearAlgebra.mm

+  out.tril_();
+  upper ? out.transpose_(ndim - 2, ndim - 1) : out;


This will silently alter the stride structure of out if upper == true. It is better be upper ? out.triu_() : out.tril_().

That's not the same. The kernel does decomposition in the lower part of the matrix. If you do out.triu_() instead of out.tril_ -> transpose, then you get the upper part of the matrix which isn't really the correct output.

Do you have some stride assumptions in the kernel, or is it stride-agnostic? If it is stride-agnostic, then the kernel could be run on the transposed variant.

It assumes that input is row major(contiguous)

out can be provided externally as column-major. What would happen in this case?

I printed data ptr inside the mps function and outside in python:

import torch out = torch.rand(3, 3, 3, device="mps").permute(2, 1, 0) x = torch.rand(3, 3, 3, device="mps") x = x.mT @ x data_ptr = out.data_ptr() print(f"0x{data_ptr:x}") # lowercase hex torch.linalg.cholesky(x, out=out) print(f"0x{out.data_ptr():x}")

Yields:

0x10a4d68d0 0x10fb19150 0x10a4d68d0

First one being print from python, 2nd one being before launching the kernel from C++ and 3rd one being again from python. So yeah confirmed

As per https://github.com/pytorch/pytorch/pull/146799/files#r1952464144, this is expected. Sorry for the confusion. But we should have issues when out is contiguous and upper=True it seems.

No issues from what I check:

import torch out = torch.rand(3, 3, 3, device="mps").permute(2, 1, 0) x = torch.rand(3, 3, 3, device="mps") x = x.mT @ x data_ptr = out.data_ptr() print(f"0x{data_ptr:x}") # lowercase hex print(out.stride()) print(out.is_contiguous()) res1 = torch.linalg.cholesky(x, out=out, upper=True) res2 = torch.linalg.cholesky(x.cpu(), out=out.cpu(), upper=True) print(f"0x{out.data_ptr():x}") print(out.stride()) torch.testing.assert_close(res1.cpu(), res2)

0x113f70cc0 (1, 3, 9) False 0x114f3a510 0x113f70cc0 (1, 3, 9)

@Isalia20 , could you remove permute so that out is contiguous? In the Meta function, as per your modification, out is re-used only if it is contiguous.

Ah I see the issue now:

0x10bc7b840 (9, 3, 1) True 0x10bc7b840 0x10bc7b840 (9, 1, 3)

nikitaved

Looks good, thank you! I have left some comments regarding the silent stride-altering behavior in out and the values of the info vector.

nikitaved · 2025-02-11T11:46:30Z

aten/src/ATen/native/BatchLinearAlgebra.cpp


  // L
-  auto L_strides = at::native::batched_matrix_contiguous_strides(A_shape, /*f-contig*=*/true);
+  auto L_strides = at::native::batched_matrix_contiguous_strides(A_shape, /*f-contig*=*/A.device().type() != at::kMPS);


Why is MPS different?

MPS Kernel assumes row-major layout for the matrix where it does the decomposition

Can the kernel be made row-major/col-major agnostic so as to be able preserve the consistency across backends?

I'll take a look ~next week to see if I can make it work for col-major so we don't need to make it row major for MPS only, but why do we want to preserve consistency across backends? Lot of ops on MPS use row major layout and require contiguous call on it before passing it to some MPS kernel

In linalg LAPACK seems like the source of truth, and it is written in Fortran where col-major is the standard layout :(

I believe we can re-use the kernel without that much code change (i.e. no need to make it stride-agnostic for now). In the Meta function we request C-contiguous when upper=False and F-contiguous when upper=True for the MPS. Then we only need to remove the line upper ? out.transpose_(...) : out (and probably replace it with out.tril_() : out.triu_(). Or something along these lines. Should resolve the issue for now with out, before the kernel is adapted for better memory accesses when in column-major mode...

I've tried it but I'm afraid it doesn't work. I'll address this in the followup PR with the kernel change for column major mode rather than going into the rabbit hole now for a temporary fix

Isalia20 · 2025-02-11T12:07:00Z

Thanks, I'll address the comments a little later today

nikitaved · 2025-02-12T10:51:55Z

test/test_mps.py

+            output_cpu = torch.linalg.cholesky_ex(input_cpu, upper=upper)
+            output_mps = torch.linalg.cholesky_ex(input_mps, upper=upper)


Let us also check that info is the same since its behavior is altered?

output_cpu and output_mps is a tuple of L and info tensors so assertEqual is comparing both of them. Do you mean to add a separate test where info might be >1?

Yes, when erroring on non-psd inputs :)

I'll do it a bit later today and also adapt the error message

Added better error message

malfet · 2025-02-13T06:58:39Z

@pytorchbot merge -f "MPS is green"

pytorchmergebot · 2025-02-13T07:00:08Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Isalia20 added 2 commits February 9, 2025 22:31

added experimental version of linalg cholesky

958c599

tests for cholesky ex

044b743

Isalia20 requested review from IvanYashchuk, albanD, kulinseth, lezcano, malfet, nikitaved and soulitzer as code owners February 9, 2025 18:48

pytorch-bot bot added the release notes: mps Release notes category label Feb 9, 2025

Isalia20 changed the title ~~Mps cholesky ex version~~ [MPS] cholesky ex version Feb 9, 2025

pytorchbot added the open source label Feb 9, 2025

malfet added the ciflow/mps Run MPS tests (subset of trunk) label Feb 11, 2025

pytorch-bot bot removed the ciflow/mps Run MPS tests (subset of trunk) label Feb 11, 2025

malfet approved these changes Feb 11, 2025

View reviewed changes

malfet added topic: improvements topic category ciflow/mps Run MPS tests (subset of trunk) labels Feb 11, 2025

nikitaved reviewed Feb 11, 2025

View reviewed changes

aten/src/ATen/native/mps/kernels/LinearAlgebra.metal Outdated Show resolved Hide resolved

nikitaved reviewed Feb 11, 2025

View reviewed changes

aten/src/ATen/native/mps/operations/LinearAlgebra.mm Outdated Show resolved Hide resolved

nikitaved reviewed Feb 11, 2025

View reviewed changes

nikitaved requested changes Feb 11, 2025

View reviewed changes

nikitaved reviewed Feb 11, 2025

View reviewed changes

zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Feb 11, 2025

Isalia20 added 2 commits February 12, 2025 14:15

info=kk + 1

4919177

remove redundant copy

d3051be

nikitaved reviewed Feb 12, 2025

View reviewed changes

better error message for check errors

4ea9ce5

pytorchmergebot added the merging label Feb 13, 2025

pytorchmergebot closed this in 17a8085 Feb 13, 2025

pytorchmergebot added Merged and removed merging labels Feb 13, 2025

malfet mentioned this pull request Feb 25, 2025

MPS operator coverage tracking issue (2.6+ version) #141287

Open

		out.tril_();
		upper ? out.transpose_(ndim - 2, ndim - 1) : out;

		output_cpu = torch.linalg.cholesky_ex(input_cpu, upper=upper)
		output_mps = torch.linalg.cholesky_ex(input_mps, upper=upper)

Conversation

Isalia20 commented Feb 9, 2025

Uh oh!

pytorch-bot bot commented Feb 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/146799

✅ No Failures

Uh oh!

github-actions bot commented Feb 9, 2025

Attention! native_functions.yaml was changed

Uh oh!

pytorch-bot bot commented Feb 11, 2025

Uh oh!

Uh oh!

Uh oh!

nikitaved Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Isalia20 Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Isalia20 commented Feb 11, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pytorch-bot bot commented Feb 9, 2025 •

edited

Loading

nikitaved Feb 11, 2025 •

edited

Loading

nikitaved Feb 12, 2025 •

edited

Loading

nikitaved Feb 12, 2025 •

edited

Loading

nikitaved Feb 12, 2025 •

edited

Loading

nikitaved Feb 12, 2025 •

edited

Loading

nikitaved left a comment •

edited

Loading

Isalia20 Feb 12, 2025 •

edited

Loading

nikitaved Feb 12, 2025 •

edited

Loading