Add linalg.lu by lezcano · Pull Request #67833 · pytorch/pytorch

lezcano · 2021-11-04T08:59:49Z

Stack from ghstack:

This PR modifies lu_unpack by:

Using less memory when unpacking L and U
Fuse the subtraction by -1 with unpack_pivots_stub
Define tensors of the correct types to avoid copies
Port lu_unpack to be a strucutred kernel so that its _out version does not incur on extra copies

Then we implement linalg.lu as a structured kernel, as we want to
compute its derivative manually. We do so because composing the derivatives
of torch.lu_factor and torch.lu_unpack would be less efficient.

This new function and lu_unpack comes with all the things it can come:
forward and backward ad, decent docs, correctness tests, OpInfo, complex support,
support for metatensors and support for vmap and vmap over the gradients.

I really hope we don't continue adding more features.

This PR also avoids saving some of the tensors that were previously
saved unnecessarily for the backward in lu_factor_ex_backward and
lu_backward and does some other general improvements here and there
to the forward and backward AD formulae of other related functions.

cc @jianyuh @nikitaved @pearu @mruberry @walterddr @IvanYashchuk @xwang233 @lezcano

This PR modifies `lu_unpack`: - Less memory usage when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually, as the composition of the derivatives of `torch.lu_factor` and `torch.lu_unpack` would incur in an extra call to `torch.lu_unpack`. This new function comes with all the things we have now: forward and backward ad, decent docs, forward tests, OpInfo, complex support and support for metatensors. I really hope we don't continue adding more features. [ghstack-poisoned]

pytorch-probot · 2021-11-04T08:59:52Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/40c765dc6d98e8078594e0c6eaf5b7f7b49d68af/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default,ciflow/all

Workflows	Labels (bold enabled)	Status
Triggered Workflows
caffe2-linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
docker-builds	`ciflow/all`, `ciflow/trunk`	✅ triggered
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-arm64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
ios-12-5-1-x86-64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
libtorch-linux-xenial-cuda10.2-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
libtorch-linux-xenial-cuda11.3-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`, `ciflow/trunk`	✅ triggered
linux-bionic-py3.7-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/trunk`	✅ triggered
linux-docs	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/docs`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-docs-push	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
linux-vulkan-bionic-py3.7-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7-no-ops	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc7-no-ops	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
macos-10-15-py3-arm64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
macos-10-15-py3-lite-interpreter-x86-64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
macos-11-py3-x86-64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	✅ triggered
parallelnative-linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
periodic-libtorch-linux-bionic-cuda11.5-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
periodic-libtorch-linux-xenial-cuda11.1-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
periodic-linux-bionic-cuda11.5-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	✅ triggered
periodic-linux-xenial-cuda11.1-py3.7-gcc7-debug	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	✅ triggered
periodic-win-vs2019-cuda11.5-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-build	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/trunk`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/trunk`, `ciflow/win`	✅ triggered
Skipped Workflows
linux-binary-conda	`ciflow/binaries`, `ciflow/binaries/conda`	🚫 skipped
linux-binary-libtorch-cxx11-abi	`ciflow/binaries`, `ciflow/binaries/libtorch`	🚫 skipped
linux-binary-libtorch-pre-cxx11	`ciflow/binaries`, `ciflow/binaries/libtorch`	🚫 skipped
linux-binary-manywheel	`ciflow/binaries`, `ciflow/binaries/wheel`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

facebook-github-bot · 2021-11-04T08:59:57Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/67833
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 44868be (more details on the Dr. CI page):

Expand to see more

1/2 failures introduced in this PR
1/2 tentatively recognized as flaky ❄️
- Click here to rerun these jobs

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages

pull / linux-xenial-py3.7-clang7-onnx / test (default, 1, 2, linux.2xlarge) (1/1)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-05-04T15:12:27.6250001Z ##[error]Process completed with exit code 135.

2022-05-04T15:12:27.4411329Z + case "$test" in
2022-05-04T15:12:27.4413059Z ++ basename /opt/conda/lib/python3.7/site-packages/torch/test/backend_fallback_test
2022-05-04T15:12:27.4418589Z + LD_LIBRARY_PATH=/opt/conda/lib/python3.7/site-packages/torch/lib
2022-05-04T15:12:27.4419401Z + /opt/conda/lib/python3.7/site-packages/torch/test/backend_fallback_test --gtest_output=xml:/var/lib/jenkins/workspace/test/test-reports/cpp/backend_fallback_test.xml
2022-05-04T15:12:27.5980243Z Running main() from /var/lib/jenkins/workspace/third_party/googletest/googletest/src/gtest_main.cc
2022-05-04T15:12:27.5980903Z �[0;32m[==========] �[mRunning 3 tests from 1 test suite.
2022-05-04T15:12:27.5981206Z �[0;32m[----------] �[mGlobal test environment set-up.
2022-05-04T15:12:27.5981505Z �[0;32m[----------] �[m3 tests from BackendFallbackTest
2022-05-04T15:12:27.5982065Z �[0;32m[ RUN      ] �[mBackendFallbackTest.TestBackendFallbackWithMode
2022-05-04T15:12:27.6220080Z .jenkins/caffe2/test.sh: line 52:  1209 Bus error               (core dumped) LD_LIBRARY_PATH="$ld_library_path" "$test" --gtest_output=xml:"$gtest_reports_dir/$(basename $test).xml"
2022-05-04T15:12:27.6250001Z ##[error]Process completed with exit code 135.
2022-05-04T15:12:27.6287862Z ##[group]Run pytorch/pytorch/.github/actions/get-workflow-job-id@master
2022-05-04T15:12:27.6288106Z with:
2022-05-04T15:12:27.6288508Z   github-token: ***
2022-05-04T15:12:27.6288681Z env:
2022-05-04T15:12:27.6288928Z   IN_CI: 1
2022-05-04T15:12:27.6289092Z   IS_GHA: 1
2022-05-04T15:12:27.6289268Z   GIT_DEFAULT_BRANCH: master
2022-05-04T15:12:27.6289441Z ##[endgroup]
2022-05-04T15:12:27.6315586Z ##[group]Run nick-fields/retry@71062288b76e2b6214ebde0e673ce0de1755740a
2022-05-04T15:12:27.6315817Z with:

❄️ 1 failure tentatively classified as flaky

but reruns have not yet been triggered to confirm:

trunk / linux-bionic-rocm5.1-py3.7-distributed / test (distributed, 2, 2, linux.rocm.gpu) (1/1)

Step: "Test" (full log | diagnosis details | 🔁 rerun) ❄️

2022-05-04T17:22:30.8835051Z RuntimeError: Proc...ated or timed out after 110.03800439834595 seconds

2022-05-04T17:22:30.8814767Z ======================================================================
2022-05-04T17:22:30.8815990Z ERROR [110.058s]: test_delayed_reduce_scatter_offload_false_prefetch_pre_shard_grad_op (__main__.TestParityWithDDP)
2022-05-04T17:22:30.8818368Z ----------------------------------------------------------------------
2022-05-04T17:22:30.8819453Z Traceback (most recent call last):
2022-05-04T17:22:30.8821492Z   File "/opt/conda/lib/python3.7/site-packages/torch/testing/_internal/common_distributed.py", line 494, in wrapper
2022-05-04T17:22:30.8822834Z     self._join_processes(fn)
2022-05-04T17:22:30.8824811Z   File "/opt/conda/lib/python3.7/site-packages/torch/testing/_internal/common_distributed.py", line 717, in _join_processes
2022-05-04T17:22:30.8826382Z     self._check_return_codes(elapsed_time)
2022-05-04T17:22:30.8831659Z   File "/opt/conda/lib/python3.7/site-packages/torch/testing/_internal/common_distributed.py", line 769, in _check_return_codes
2022-05-04T17:22:30.8833078Z     i, elapsed_time
2022-05-04T17:22:30.8835051Z RuntimeError: Process 0 terminated or timed out after 110.03800439834595 seconds
2022-05-04T17:22:30.8836095Z 
2022-05-04T17:22:30.8837367Z ----------------------------------------------------------------------
2022-05-04T17:22:30.8838778Z Ran 206 tests in 3296.336s
2022-05-04T17:22:30.8839280Z 
2022-05-04T17:22:30.8839776Z FAILED (errors=1, unexpected successes=3)
2022-05-04T17:22:30.8840431Z 
2022-05-04T17:22:30.8840858Z Generating XML reports...
2022-05-04T17:22:30.8930773Z Generated XML report: test-reports/python-unittest/distributed.fsdp.test_fsdp_core/TEST-TestHooks-20220504162734.xml
2022-05-04T17:22:30.8932914Z Generated XML report: test-reports/python-unittest/distributed.fsdp.test_fsdp_core/TEST-TestNoGrad-20220504162734.xml
2022-05-04T17:22:30.8936313Z Generated XML report: test-reports/python-unittest/distributed.fsdp.test_fsdp_core/TEST-TestParamInit-20220504162734.xml

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

This PR modifies `lu_unpack`: - Less memory usage when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually, as the composition of the derivatives of `torch.lu_factor` and `torch.lu_unpack` would incur in an extra call to `torch.lu_unpack`. This new function comes with all the things we have now: forward and backward ad, decent docs, forward tests, OpInfo, complex support and support for metatensors. I really hope we don't continue adding more features. ghstack-source-id: d0212c1 Pull Request resolved: #67833

This PR modifies `lu_unpack`: - Less memory usage when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually, as the composition of the derivatives of `torch.lu_factor` and `torch.lu_unpack` would incur in an extra call to `torch.lu_unpack`. This new function comes with all the things we have now: forward and backward ad, decent docs, forward tests, OpInfo, complex support and support for metatensors. I really hope we don't continue adding more features. cc jianyuh nikitaved pearu mruberry walterddr IvanYashchuk xwang233 Lezcano [ghstack-poisoned]

This PR modifies `lu_unpack`: - Less memory usage when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually, as the composition of the derivatives of `torch.lu_factor` and `torch.lu_unpack` would incur in an extra call to `torch.lu_unpack`. This new function comes with all the things we have now: forward and backward ad, decent docs, forward tests, OpInfo, complex support and support for metatensors. I really hope we don't continue adding more features. ghstack-source-id: 0d31da5 Pull Request resolved: #67833

This PR modifies `lu_unpack`: - Less memory usage when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually, as the composition of the derivatives of `torch.lu_factor` and `torch.lu_unpack` would incur in an extra call to `torch.lu_unpack`. This new function comes with all the things we have now: forward and backward ad, decent docs, forward tests, OpInfo, complex support and support for metatensors. I really hope we don't continue adding more features. cc jianyuh nikitaved pearu mruberry walterddr IvanYashchuk xwang233 Lezcano [ghstack-poisoned]

This PR modifies `lu_unpack` by: - Using less memory when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually. We do so because composing the derivatives of `torch.lu_factor` and `torch.lu_unpack` would be less efficient. This new function and `lu_unpack` comes with all the things it can come: forward and backward ad, decent docs, correctness tests, OpInfo, complex support, support for metatensors and support for vmap and vmap over the gradients. I really hope we don't continue adding more features. This PR also avoids saving some of the tensors that were previously saved unnecessarily for the backward in `lu_factor_ex_backward` and `lu_backward` and does some other general improvements here and there to the forward and backward AD formulae of other related functions. cc jianyuh nikitaved pearu mruberry walterddr IvanYashchuk xwang233 Lezcano [ghstack-poisoned]

lezcano · 2022-04-05T17:07:23Z

The reds in the CI are unrelated.

This PR modifies `lu_unpack` by: - Using less memory when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually. We do so because composing the derivatives of `torch.lu_factor` and `torch.lu_unpack` would be less efficient. This new function and `lu_unpack` comes with all the things it can come: forward and backward ad, decent docs, correctness tests, OpInfo, complex support, support for metatensors and support for vmap and vmap over the gradients. I really hope we don't continue adding more features. This PR also avoids saving some of the tensors that were previously saved unnecessarily for the backward in `lu_factor_ex_backward` and `lu_backward` and does some other general improvements here and there to the forward and backward AD formulae of other related functions. cc jianyuh nikitaved pearu mruberry walterddr IvanYashchuk xwang233 Lezcano [ghstack-poisoned]

lezcano · 2022-05-04T10:57:44Z

@pytorchmergebot please merge this

lezcano · 2022-05-04T13:14:42Z

@pytorchmergebot merge this please

pytorchmergebot · 2022-05-04T13:16:26Z

Merge failed due to Matched rule superuser, but it was not reviewed yet by any of:sshawnwu,terrychenism,wconstab,fbbradheintz,jnkwok1, ...
Raised by https://github.com/pytorch/pytorch/actions/runs/2269967856

lezcano · 2022-05-04T13:19:23Z

@mruberry could I have this stamped please?

mruberry

Cool! Stamped!

lezcano · 2022-05-04T14:04:56Z

@pytorchmergebot merge this please

pytorchmergebot · 2022-05-04T14:06:31Z

Merge failed due to Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x 4b83526396541d06936d59765872466c27265a5f returned non-zero exit code 1

Auto-merging aten/src/ATen/native/native_functions.yaml
Auto-merging test/allowlist_for_publicAPI.json
Auto-merging test/test_linalg.py
Auto-merging tools/autograd/derivatives.yaml
Auto-merging tools/autograd/gen_variable_type.py
CONFLICT (content): Merge conflict in tools/autograd/gen_variable_type.py
Auto-merging torch/_torch_docs.py
Auto-merging torch/csrc/autograd/FunctionsManual.cpp
Auto-merging torch/overrides.py
Auto-merging torch/testing/_internal/common_methods_invocations.py
error: could not apply 4b83526396... Add linalg.lu
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".

Raised by https://github.com/pytorch/pytorch/actions/runs/2270253067

This PR modifies `lu_unpack` by: - Using less memory when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually. We do so because composing the derivatives of `torch.lu_factor` and `torch.lu_unpack` would be less efficient. This new function and `lu_unpack` comes with all the things it can come: forward and backward ad, decent docs, correctness tests, OpInfo, complex support, support for metatensors and support for vmap and vmap over the gradients. I really hope we don't continue adding more features. This PR also avoids saving some of the tensors that were previously saved unnecessarily for the backward in `lu_factor_ex_backward` and `lu_backward` and does some other general improvements here and there to the forward and backward AD formulae of other related functions. cc jianyuh nikitaved pearu mruberry walterddr IvanYashchuk xwang233 Lezcano [ghstack-poisoned]

lezcano · 2022-05-05T09:15:34Z

@pytorchmergebot merge this please

github-actions · 2022-05-05T09:17:40Z

Hey @lezcano.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

malfet · 2022-05-06T18:48:12Z

This PR broke internal builds, because BatchLinearAlgebraKernels.cpp references for_each method from TensorIterator.cpp, but for some reason they are part of different filelists, see :

pytorch/tools/build_variables.bzl

Line 1377 in 6c615a2

"aten/src/ATen/TensorIterator.cpp",

but

pytorch/tools/build_variables.bzl

Line 1065 in 6c615a2

"aten/src/ATen/native/BatchLinearAlgebraKernel.cpp",

@cccclai you've introduced the split, do you know why some native files are in one filelist, but others in another?

lezcano requested review from IvanYashchuk, albanD, ezyang, nikitaved and soulitzer as code owners November 4, 2021 08:59

lezcano mentioned this pull request Nov 4, 2021

Add linalg.lu_factor #66933

Closed

pytorch-probot Bot added the ciflow/default label Nov 4, 2021

This was referenced Nov 4, 2021

Add contiguous_strides as a correct replacement of defaultStride #67789

Closed

Make linalg.lu_factor structured #66934

Closed

facebook-github-bot added the cla signed label Nov 4, 2021

This was referenced Nov 4, 2021

Remove unnecessary sync in linalg.det #67014

Closed

Add linalg.lu #67832

Closed

lezcano removed request for IvanYashchuk, albanD, ezyang, nikitaved and soulitzer November 4, 2021 09:00

lezcano marked this pull request as draft November 4, 2021 09:00

lezcano added the module: linear algebra Issues related to specialized linear algebra operations in PyTorch; includes matrix multiply matmul label Nov 4, 2021

pytorchbot added the open source label Nov 4, 2021

lezcano mentioned this pull request Nov 4, 2021

Gradient test test_fn_grad_linalg_det_singular_cpu_complex128 failing on CPU #67767

Closed

lezcano mentioned this pull request Nov 8, 2021

Generalize noncontiguous tests to several outputs #67996

Closed

lezcano mentioned this pull request Nov 17, 2021

Remove random_fullrank_matrix_distinc_singular_value #68183

Closed

lezcano added 10 commits March 10, 2022 19:19

lezcano added 6 commits April 5, 2022 20:21

mruberry approved these changes May 4, 2022

View reviewed changes

malfet mentioned this pull request May 6, 2022

[xplat] Move BatchLinearAlgebraKernel.cpp to aten_native_source_non_codegen_list #76987

Closed

peterbell10 mentioned this pull request Jul 30, 2022

Add missing TORCH_ASSERT_*_OPERATORS defines in native/cuda #82529

Closed

Conversation

lezcano commented Nov 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-probot Bot commented Nov 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Nov 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

pull / linux-xenial-py3.7-clang7-onnx / test (default, 1, 2, linux.2xlarge) (1/1)

❄️ 1 failure tentatively classified as flaky

trunk / linux-bionic-rocm5.1-py3.7-distributed / test (distributed, 2, 2, linux.rocm.gpu) (1/1)

Uh oh!

lezcano commented Apr 5, 2022

Uh oh!

lezcano commented May 4, 2022

Uh oh!

lezcano commented May 4, 2022

Uh oh!

pytorchmergebot commented May 4, 2022

Uh oh!

lezcano commented May 4, 2022

Uh oh!

mruberry left a comment

Choose a reason for hiding this comment

Uh oh!

lezcano commented May 4, 2022

Uh oh!

pytorchmergebot commented May 4, 2022

Uh oh!

lezcano commented May 5, 2022

Uh oh!

github-actions Bot commented May 5, 2022

Uh oh!

malfet commented May 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

lezcano commented Nov 4, 2021 •

edited

Loading

pytorch-probot Bot commented Nov 4, 2021 •

edited

Loading

facebook-github-bot commented Nov 4, 2021 •

edited

Loading