Lower aten::_linalg_eigh by tengyifei · Pull Request #7674 · pytorch/xla

tengyifei · 2024-07-12T08:25:03Z

There's an xla::SelfAdjointEig function so we lower it to that.

I discovered that the XLA implementation of eigenvalue decomposition is not as numerically stable as numpy or torch, despite passing a small tolerance and large max_iter. The unit test thus uses a hardcoded tensor value.

tengyifei · 2024-07-12T08:29:03Z

cc @vanbasten23

vanbasten23 · 2024-07-12T22:04:33Z

+  if (!compute_v) {
+    // Fallback to aten in case of `eigvalsh`, which does not compute
+    // eigenvectors but requires numerically stable gradients.
+    return at::native::call_fallback_fn<&xla_fallback,


I understand we only need to lower torch.linalg.eigh. But in case we need to lower eigvalsh later, would requires numerically stable gradients be a blocker so we have to fall back to aten?

So I read the PyTorch docs again and I think I misunderstood it initially. What PyTorch doc suggests is that the gradients of the eigenvectors are unstable. Therefore, if the user calls eigvalsh, they will only get eigenvalues and thus the gradients will be stable. I removed this misleading comment.

If we want to support eigvalsh later the simplest way is probably discarding the eigenvectors from XLA and also figuring out what to return for the second at::Tensor tuple member.

vanbasten23 · 2024-07-12T22:12:01Z

+
+std::array<xla::XlaOp, 2> LowerImpl(xla::XlaOp input, bool lower) {
+  auto [eigenvectors, eigenvalues] =
+      xla::SelfAdjointEig(input, lower, /* max_iter */ 64, /* tol */ 1e-6);


What are the max_iter and tol (for opaque) used for?
Also, could you add a comment on why changing the default value?

When testing I discovered that the default settings lead to a very low accuracy in the reconstructed matrix (e.g. let's say we decompose A to A' = V @ Q @ V_T, then A and A' have a difference in elements above 0.1.

After looking at what JAX does I think it can be simpler to align with JAX: https://github.com/google/jax/blob/a8b425cac50c842f66f36903dfb93fe6ad5a2a5b/jax/_src/lax/linalg.py#L726. Looks like they use the same tolerance but higher max_iter.

There's an xla::SelfAdjointEig function so we lower it to that. I discovered that the XLA implementation of eigenvalue decomposition is not as numerically stable as numpy or torch, despite passing a small tolerance and large max_iter. The unit test thus uses a hardcoded tensor value copied from https://android.googlesource.com/platform/external/tensorflow/+/f2a058296dd/tensorflow/compiler/xla/client/lib/self_adjoint_eig_test.cc#149

tengyifei force-pushed the linalg.eigh branch from 7c06f44 to ccf0bac Compare July 12, 2024 08:26

JackCaoG reviewed Jul 12, 2024

View reviewed changes

Comment thread test/cpp/test_aten_xla_tensor_2.cpp Outdated

JackCaoG reviewed Jul 12, 2024

View reviewed changes

Comment thread torch_xla/csrc/ops/eigh.cpp Outdated

JackCaoG approved these changes Jul 12, 2024

View reviewed changes

vanbasten23 reviewed Jul 12, 2024

View reviewed changes

vanbasten23 approved these changes Jul 12, 2024

View reviewed changes

tengyifei force-pushed the linalg.eigh branch from ccf0bac to 0ff21d4 Compare July 14, 2024 09:23

tengyifei merged commit f975ad6 into pytorch:master Jul 15, 2024

miladm assigned miladm and tengyifei and unassigned miladm Jul 16, 2024

miladm added the lowering ATen Operation lowering label Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lower aten::_linalg_eigh#7674

Lower aten::_linalg_eigh#7674
tengyifei merged 1 commit intopytorch:masterfrom
tengyifei:linalg.eigh

tengyifei commented Jul 12, 2024 •

edited

Loading

Uh oh!

tengyifei commented Jul 12, 2024

Uh oh!

Uh oh!

Uh oh!

vanbasten23 Jul 12, 2024

Uh oh!

tengyifei Jul 14, 2024

Uh oh!

vanbasten23 Jul 12, 2024

Uh oh!

tengyifei Jul 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tengyifei commented Jul 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tengyifei commented Jul 12, 2024

Uh oh!

Uh oh!

Uh oh!

vanbasten23 Jul 12, 2024

Choose a reason for hiding this comment

Uh oh!

tengyifei Jul 14, 2024

Choose a reason for hiding this comment

Uh oh!

vanbasten23 Jul 12, 2024

Choose a reason for hiding this comment

Uh oh!

tengyifei Jul 14, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tengyifei commented Jul 12, 2024 •

edited

Loading