Added CUDA support for complex input for torch.cholesky_solve by IvanYashchuk · Pull Request #47047 · pytorch/pytorch

IvanYashchuk · 2020-10-29T10:54:41Z

torch.cholesky_solve now works for complex inputs on GPU.
I moved the existing tests to test_linalg.py and modified them to test complex and float32 dtypes.
Differentiation also works correctly with complex inputs now.

Ref. #33152

for complex dtype input

dr-ci · 2020-10-29T11:03:36Z

💊 CI failures summary and remediations

As of commit 09d7262 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 43 times.

codecov · 2020-10-29T14:18:16Z

Codecov Report

Merging #47047 (09d7262) into master (b726a1b) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master   #47047   +/-   ##
=======================================
  Coverage   80.79%   80.79%           
=======================================
  Files        1865     1865           
  Lines      201074   201074           
=======================================
+ Hits       162456   162459    +3     
+ Misses      38618    38615    -3

facebook-github-bot · 2020-10-30T17:32:00Z

Hi @IvanYashchuk!

Thank you for your pull request. We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but we do not have a signature on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

…solve

anjali411 · 2020-11-02T16:18:03Z

    MAGMAQueue magma_queue(self.get_device());

-    constexpr int64_t batch_limit = 262140;
+    int64_t batch_limit = self.is_complex() ? 65535 : 262140;


why do we have different batch limit for complex and non-complex dtypes? can you link me to where this is documented?

I don't know whether it's documented somewhere, I determined this value via experiments.

CUDA limits kernel launches to y and z grid dimension to 65535. Maybe for non-complex dtypes batching is implemented differently allowing 262140 batches.

synced with @ngimel offline. We should check the magma manual, and better document this difference in the batch_limit since the original comments are uninformative.

It's not documented in magma.
CUDA limits kernel launch configurations of y and z grid dimensions to 65535.
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications

Maximum x-dimension of a grid of thread blocks 2^31-1

Maximum y- or z-dimension of a grid of thread blocks 65535

I haven't checked the source code for how batching is done for non-complex dtypes, but apparently, complex variants use z-dimension of a grid of thread blocks for batching.

I spent a little time looking at the complex path and didn't figure it out, but I did see this:

if ( n > 2048 ) { #ifndef MAGMA_NOWARNING printf("=========================================================================================\n" " WARNING batched routines are designed for small sizes. It might be better to use the\n" " Native/Hybrid classical routines if you want good performance.\n" "=========================================================================================\n"); #endif }

in magma_cpotrf_lg_batched

Yeah we should use cusolver for those, if we don't already.

cc @heitorschueroff, @xwang233 Can you guys please create a tracking issue which linalg functions under which conditions use magma or cusolver or cublas, and which functions still need to be weaned off magma and switched to cusolver?

Thanks, I'll create a tracking issue.

@xwang233's issue is here: #47953

anjali411 · 2020-11-02T16:19:16Z

+                    A = root.tril()
+                return torch.cholesky_solve(b, A, upper)
+
+            gradcheck(func, [root, b, upper])


@IvanYashchuk please move the autograd tests to common_methods_invocations.py

I think common_methods_invocations.py does not allow specifying the input function to be tested, it allows specifying only the postprocessing function.
Finite differencing doesn't work correctly for torch.cholesky_solve directly, therefore

def func(A, b, upper): if upper: A = A.triu() else: A = A.tril() return torch.cholesky_solve(b, A, upper)

is tested instead.

I see. I also synced with @mruberry offline and we came to the conclusion it's ok to add autograd tests in test_linalg.py.

facebook-github-bot

@anjali411 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…solve

IvanYashchuk · 2020-11-29T16:17:41Z

@mruberry, I think we are ready to import this PR.

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mruberry · 2020-12-03T17:02:12Z

Sorry @IvanYashchuk, looks like this picked up a merge conflict. Would you rebase?

IvanYashchuk · 2020-12-03T19:33:53Z

Done.

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-12-06T05:11:06Z

@mruberry merged this pull request in 85121a7.

…h#47047) Summary: `torch.cholesky_solve` now works for complex inputs on GPU. I moved the existing tests to `test_linalg.py` and modified them to test complex and float32 dtypes. Differentiation also works correctly with complex inputs now. Ref. pytorch#33152 Pull Request resolved: pytorch#47047 Reviewed By: ngimel Differential Revision: D24730020 Pulled By: mruberry fbshipit-source-id: 95402da5789c56e5a682019790985207fa28fa1f

IvanYashchuk added 6 commits October 28, 2020 13:05

Moved tests to test_linalg.py

a0fe8f5

Added complex dtype dispatch for cholesky_solve cuda

02d79e4

Added supports complex input note to docs

355e526

Add complex and fp32 dtype test cases

758c0e3

Modified batch_limit. Fixes CUDA error: invalid configuration argument

abb18a1

for complex dtype input

Fix typo

cb794c3

IvanYashchuk added module: complex Related to complex number support in PyTorch module: linear algebra Issues related to specialized linear algebra operations in PyTorch; includes matrix multiply matmul labels Oct 29, 2020

IvanYashchuk requested a review from anjali411 October 29, 2020 10:54

IvanYashchuk requested review from albanD and apaszke as code owners October 29, 2020 10:54

IvanYashchuk removed request for albanD and apaszke October 29, 2020 10:54

IvanYashchuk mentioned this pull request Oct 29, 2020

Complex Numbers Support #33152

Closed

pytorchbot added the open source label Oct 29, 2020

Moved autograd test to test_linalg.py and added complex support

d2ffbf2

IvanYashchuk force-pushed the complex-cholesky-solve branch from 4c73f09 to d2ffbf2 Compare October 29, 2020 11:03

albanD added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 29, 2020

facebook-github-bot added the cla signed label Oct 31, 2020

Merge remote-tracking branch 'upstream/master' into complex-cholesky-…

05c3b75

…solve

anjali411 reviewed Nov 2, 2020

View reviewed changes

Merge branch 'master' into complex-cholesky-solve

22a80ac

facebook-github-bot reviewed Nov 4, 2020

View reviewed changes

anjali411 reviewed Nov 4, 2020

View reviewed changes

Comment thread test/test_linalg.py Outdated

anjali411 reviewed Nov 4, 2020

View reviewed changes

Comment thread torch/testing/_internal/common_utils.py Outdated

Fixed typos

1ccbbd6

IvanYashchuk requested review from yf225 and zhaojuanmao as code owners November 29, 2020 16:06

IvanYashchuk removed request for ebetica, glaringlee, goldsborough, mingzhe09088, mrshenli, pietern, pritamdamania87, rohan-varma, yf225 and zhaojuanmao November 29, 2020 16:07

IvanYashchuk added 4 commits November 29, 2020 10:13

Updated comment to use direct github link

f34de82

Added a link on the discussion of the batch limit for complex input

c5a7974

Merge remote-tracking branch 'upstream/master' into complex-cholesky-…

d412f81

…solve

Removed duplicated tests

24979e6

IvanYashchuk force-pushed the complex-cholesky-solve branch from a55cd87 to 24979e6 Compare November 29, 2020 16:15

facebook-github-bot reviewed Dec 2, 2020

View reviewed changes

Merge branch 'master' into complex-cholesky-solve

09d7262

facebook-github-bot reviewed Dec 3, 2020

View reviewed changes

facebook-github-bot closed this in 85121a7 Dec 6, 2020

facebook-github-bot added the Merged label Dec 6, 2020

xwang233 mentioned this pull request Dec 8, 2020

test_cholesky_solve_batched_many_batches_cuda_complex128 has cuda illegal memory access #48996

Closed

Conversation

IvanYashchuk commented Oct 29, 2020

Uh oh!

dr-ci Bot commented Oct 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

codecov Bot commented Oct 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

facebook-github-bot commented Oct 30, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk Nov 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

IvanYashchuk commented Nov 29, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

mruberry commented Dec 3, 2020

Uh oh!

IvanYashchuk commented Dec 3, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Dec 6, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

dr-ci Bot commented Oct 29, 2020 •

edited

Loading

codecov Bot commented Oct 29, 2020 •

edited

Loading

IvanYashchuk Nov 2, 2020 •

edited

Loading