Added Kronecker product of tensors (torch.kron) by IvanYashchuk · Pull Request #45358 · pytorch/pytorch

IvanYashchuk · 2020-09-25T20:53:05Z

This PR adds a function for calculating the Kronecker product of tensors.
The implementation is based on at::tensordot with permutations and reshape.
Tests pass.

TODO:

Add more test cases
Write documentation
Add entry common_methods_invokations.py

Ref. #42666

Tests pass. The implementation is based on tensordot.

dr-ci · 2020-09-25T22:24:44Z

💊 CI failures summary and remediations

As of commit a1b3255 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 100 times.

codecov · 2020-09-29T19:33:04Z

Codecov Report

Merging #45358 into master will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master   #45358   +/-   ##
=======================================
  Coverage   60.81%   60.81%           
=======================================
  Files        2748     2748           
  Lines      254027   254070   +43     
=======================================
+ Hits       154488   154522   +34     
- Misses      99539    99548    +9

vishwakftw

Overall, looks good to me. Can we get some benchmarks too - comparison to np.linalg.kron?

IvanYashchuk · 2020-10-05T21:13:05Z

Here is the code for the benchmark.
The first column shows the input shapes.

Using pytorch 1.7.0a0+18e2767
Using cupy 8.0.0
Using numpy 1.19.2
[-------------------------------------------- kron torch.float64 -------------------------------------------]
                                  |  torch.kron CUDA  |  cupy.kron CUDA  |  torch.kron CPU  |  numpy.kron CPU
8 threads: --------------------------------------------------------------------------------------------------
      ((32,), (2, 16, 32))        |        54.5       |      224.8       |       47.4       |       128.2    
      ((32,), (32, 32))           |        39.1       |      169.7       |       44.8       |       114.8    
      ((32,), (32,))              |        25.3       |      123.9       |       15.7       |        43.6    
      ((32, 32), (2, 16, 32))     |       115.4       |      306.7       |      271.0       |      3968.2    
      ((32, 32), (32, 32))        |       113.6       |      260.8       |      207.0       |      3207.8    
      ((32, 32), (32,))           |        24.8       |      263.8       |       21.9       |       167.4    
      ((2, 16, 32), (2, 16, 32))  |       114.9       |      374.2       |      267.5       |      4214.7    
      ((2, 16, 32), (32, 32))     |       113.7       |      339.4       |      210.6       |      4026.5    
      ((2, 16, 32), (32,))        |        25.2       |      299.9       |       21.6       |       181.2 

[-------------------------------------------- kron torch.float32 -------------------------------------------]
                                  |  torch.kron CUDA  |  cupy.kron CUDA  |  torch.kron CPU  |  numpy.kron CPU
8 threads: --------------------------------------------------------------------------------------------------
      ((32,), (2, 16, 32))        |        52.3       |      234.5       |       37.9       |        90.9    
      ((32,), (32, 32))           |        38.0       |      173.7       |       36.5       |        80.0    
      ((32,), (32,))              |        23.2       |      126.0       |       14.6       |        41.3    
      ((32, 32), (2, 16, 32))     |        71.2       |      318.2       |      212.8       |      2125.3    
      ((32, 32), (32, 32))        |        59.7       |      259.0       |      116.1       |      1686.4    
      ((32, 32), (32,))           |        23.9       |      256.9       |       19.1       |       120.7    
      ((2, 16, 32), (2, 16, 32))  |        61.7       |      298.4       |      207.3       |      2127.6    
      ((2, 16, 32), (32, 32))     |        57.5       |      312.1       |      115.8       |      2107.0    
      ((2, 16, 32), (32,))        |        25.0       |      300.2       |       19.8       |       132.2    

Times are in microseconds (us).

vishwakftw · 2020-10-05T21:36:26Z

Any idea why some cases are not as fast?

IvanYashchuk · 2020-10-06T07:18:31Z

Well, the implementation is different. NumPy's implementation is based on outer (link to code), while here tensordot is used. torch.outer is for vectors only and probably would require more manipulations to get it right.

IvanYashchuk · 2020-10-20T08:08:29Z

Alright, I've realized that the previous timings were in debug mode 😄
I've updated the previous post. Now we see that the current implementation is faster than NumPy and CuPy.

…out kwarg

emmatyping · 2020-10-23T06:49:53Z

Hi, this is really exciting to see, I was hoping to use the kronecker product with complex tensors, but I couldn't discern if that would be supported by this. I look forward to using this!

IvanYashchuk · 2020-10-26T11:22:44Z

Updated _out implementation to use at::native::resize_output.
Explicit dispatch: Math is now used native_functions.yml.
Added non-contiguous test cases.
Documentation now includes the mathematical definition of the Kronecker product as on the Wikipedia page (tested locally that it renders correctly now).

mruberry · 2020-10-26T11:48:20Z

+
+Computes the Kronecker product, denoted by :math:`\otimes`, of :attr:`input` and :attr:`other`.
+
+If :attr:`input` is a :math:`(m \times n)` tensor and :attr:`other` is a


We didn't discuss this previously, but is the Kronecker product defined if either of A or B aren't matrices? Should we add a check for that?

I think the Kronecker product is defined mathematically only for matrices. We can think of vectors as m×1 matrix, and scalars as 1×1 matrix, then everything works.
Vectors are tested here (as (4, ) shape).
As for n-dimensional arrays with n>2, NumPy extends the definition as described in the notes section to "blocks of the second tensor scaled by the first tensor".
So kron does not support batching.

Sometimes it's said that for matrices Kronecker product == tensor outer product, but this is not true for tensors in general. For the example from Wiki about the tensor product, kron would give a tensor with dimensions (31, 510, 7*100).

Being like NumPy seems OK (our goal is to be compatible, after all). Would you add a note about how input and other are treated if they're not matrices? The current docs deal with them as if they must be matrices. Something like:

NOTE

The Kronecker product is typically defined only for two matrices

When either is a scalar or vector it's unsqueezed as...

When either is an input is a tensor with 3+ dimensions then...

ALTERNATIVELY you could expand the description of the function describe the matrix case, say that scalars and vectors are unsqueezed to be matrices, and THEN define the "general" case. That seems like a more challenging but better approach.

What are your thoughts, @IvanYashchuk?

What about in the main description present the general definition first and then mention what it does for matrices.

Computes the Kronecker product of input and other.
For general n-dimensional tensors this function computes:
... math expression

The number of dimensions of input and other is assumed to be the same and if necessary the smaller tensor is unsqueezed as the larger one.

If input is a (m \times n) ~~tensor~~ matrix and other is a (p \times q) ~~tensor~~ matrix, the result will be a (p*m \times q*n) block ~~tensor~~ matrix:
... kron definition for matrices from wiki
Scalar and vector inputs are unsqueezed to be matrices.

That approach sounds good. Instead of

"The number of dimensions of input and other is assumed to be the same and if necessary the smaller tensor is unsqueezed as the larger one."

I think it can say "If one tensor has fewer dimensions than the other it is unsqueezed until it has the same number of dimensions."

That change would make the explicit reference to scalar and vector inputs at the end redundant, so it can be removed.

The last caveat is that this needs to define the dot operator in both equations above.

Okay, dot was for the normal multiplication of scalars. Is asterisk (*) preferred?

Dot's OK if the documentation defines it (the dot operator is used for so many mathematical operations it's highly ambiguous), an asterisk would also be fine and probably doesn't require definition (it's typically used for elementwise multiplication and scalar multiplication).

I've updated the documentation so that the main text describes now the computation for general n-dimensional tensors. I left the matrix case description as a note section.
Here is the image of rendered docs:

mruberry

Awesome work, @IvanYashchuk. Made one last small docs comment for your review.

It'll be great to have torch.kron available. I was just talking to a PyTorch user the other day about how he's using matrices with a "Kronecker structure" as a form of structured sparsity.

Do me a favor, though, and when the updates are ready for review be sure to re-request review or say "This is ready for another review." With the number of PRs I'm tracking it's hard to understand when updates are being made vs. something is ready for review again.

Just ping me when you'd like this merged.

facebook-github-bot · 2020-10-30T17:25:18Z

Hi @IvanYashchuk!

Thank you for your pull request. We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but we do not have a signature on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

IvanYashchuk · 2020-11-02T10:08:00Z

Hi @mruberry, I think now this PR should be ready for merging.

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

IvanYashchuk · 2020-11-02T11:22:37Z

Tests seem to fail. Is it related to common_methods_invocations.py's method_tests?

mruberry · 2020-11-02T11:29:34Z

Tests seem to fail. Is it related to common_methods_invocations.py's method_tests?

I would ignore the "Facebook internal" build signal. It's complete nonsense.

facebook-github-bot · 2020-11-03T21:15:59Z

@mruberry merged this pull request in f276ab5.

Summary: This PR adds a function for calculating the Kronecker product of tensors. The implementation is based on `at::tensordot` with permutations and reshape. Tests pass. TODO: - [x] Add more test cases - [x] Write documentation - [x] Add entry `common_methods_invokations.py` Ref. pytorch#42666 Pull Request resolved: pytorch#45358 Reviewed By: mrshenli Differential Revision: D24680755 Pulled By: mruberry fbshipit-source-id: b1f8694589349986c3abfda3dc1971584932b3fa

Added torch.kron

99a87b2

Tests pass. The implementation is based on tensordot.

pytorchbot added the open source label Sep 25, 2020

IvanYashchuk added 5 commits September 28, 2020 09:20

Updated tests

90f0159

Now kron permutation and reshape is correct

98a2f49

Rewrote using ternary

5a0dcf9

flake8

4c30a9a

Added documentation

3716b6a

IvanYashchuk marked this pull request as ready for review September 28, 2020 17:12

IvanYashchuk added 2 commits September 29, 2020 01:16

Merge remote-tracking branch 'upstream/master' into linalg-kron

8f3fb2a

Added fp32, complex dtypes

9931ad9

IvanYashchuk added the module: numpy Related to numpy support, and also numpy compatibility of our operators label Sep 29, 2020

Added overrides entry for torch.kron

18e2767

zhangguanheng66 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Sep 29, 2020

zhangguanheng66 requested a review from vishwakftw September 29, 2020 20:12

IvanYashchuk mentioned this pull request Oct 5, 2020

torch.linalg in PyTorch 1.10 tracker #42666

Closed

4 tasks

vishwakftw reviewed Oct 5, 2020

View reviewed changes

IvanYashchuk added 5 commits October 20, 2020 08:21

Renamed a, b -> self, other

e66ab58

Added out= variant

dfb00d3

Updated documentation: added note on real and complex support, added …

f886d19

…out kwarg

Added empty and out= test cases

c60487b

Added entry to common_methods_invocations.py

76354a5

IvanYashchuk requested a review from mruberry October 20, 2020 09:19

IvanYashchuk added 2 commits October 20, 2020 09:23

Added test for error with incorrect out=

1f94849

Merge remote-tracking branch 'upstream/master' into linalg-kron

9bd2f3f

IvanYashchuk added 3 commits October 24, 2020 11:33

Merge remote-tracking branch 'upstream/master' into linalg-kron

b45d6c8

Use bmatrix instead of array in matrix example

e22a079

Fixed non-contiguous tests

fa90e71

IvanYashchuk requested a review from mruberry October 26, 2020 11:22

mruberry reviewed Oct 26, 2020

View reviewed changes

Comment thread aten/src/ATen/native/LinearAlgebra.cpp Outdated

mruberry reviewed Oct 26, 2020

View reviewed changes

IvanYashchuk added 4 commits October 26, 2020 10:14

Updated comment in LinearAlgebra.cpp

5ad2f45

Updated docs for general n-dim tensors

1cf37ff

Changed 'block tensor' -> 'tensor'

60db523

Fix too long line

5888b0e

mruberry reviewed Oct 30, 2020

View reviewed changes

Comment thread torch/_torch_docs.py Outdated

mruberry self-requested a review October 30, 2020 16:50

mruberry approved these changes Oct 30, 2020

View reviewed changes

facebook-github-bot added the cla signed label Oct 31, 2020

Changed the note according to Mike's suggestion

719a687

IvanYashchuk requested a review from mruberry November 2, 2020 10:08

Merge remote-tracking branch 'upstream/master' into linalg-kron

a1b3255

facebook-github-bot reviewed Nov 2, 2020

View reviewed changes

facebook-github-bot closed this in f276ab5 Nov 3, 2020

facebook-github-bot added the Merged label Nov 3, 2020

mruberry mentioned this pull request Nov 27, 2020

NumPy-like Functionality Request Rollup #38349

Closed

41 tasks


		Computes the Kronecker product, denoted by :math:`\otimes`, of :attr:`input` and :attr:`other`.

		If :attr:`input` is a :math:`(m \times n)` tensor and :attr:`other` is a

Conversation

IvanYashchuk commented Sep 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Sep 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

codecov Bot commented Sep 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vishwakftw left a comment

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk commented Oct 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vishwakftw commented Oct 5, 2020

Uh oh!

IvanYashchuk commented Oct 6, 2020

Uh oh!

IvanYashchuk commented Oct 20, 2020

Uh oh!

emmatyping commented Oct 23, 2020

Uh oh!

IvanYashchuk commented Oct 26, 2020

Uh oh!

Uh oh!

mruberry Oct 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

mruberry Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

mruberry Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

mruberry Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk Oct 26, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mruberry left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Oct 30, 2020

Uh oh!

IvanYashchuk commented Nov 2, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk commented Nov 2, 2020

Uh oh!

mruberry commented Nov 2, 2020

Uh oh!

facebook-github-bot commented Nov 3, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

IvanYashchuk commented Sep 25, 2020 •

edited

Loading

dr-ci Bot commented Sep 25, 2020 •

edited

Loading

codecov Bot commented Sep 29, 2020 •

edited

Loading

IvanYashchuk commented Oct 5, 2020 •

edited

Loading

mruberry Oct 26, 2020 •

edited

Loading

mruberry left a comment •

edited

Loading