add linalg.lstsq by cjekel · Pull Request #2165 · cupy/cupy

cjekel · 2019-04-28T19:40:38Z

Adds cupy.linalg.lstsq

This solves a least squares problem using SVD similar to numpy.linalg.lstsq.

This only returns the least-squares solution, while numpy.linalg.lstsq returns the residuals, rank, and singular values in addition to the least-squares solution.

If it's necessary to have 100% return syntax match to numpy.linalg.lstsq, I can modify this pr to include residuals, rank, and singular values.

Closes #1273

Edit 04-28-2019 22:54 EDT.

I have a benchmark comparing the performance of this cupy.linalg.lstsq vs numpy.linalg.lstsq, where CuPy was about six times faster on an AMD FX-8350 with NVIDIA Titan Xp for 6,000,000 data points. I'd be curious if this was true on different hardware configurations. The code to run the benchmark is here, and was run in the following order:

python3 sine_benchmark_fixed_six_break_points.py
python3 sine_benchmark_fixed_six_break_points_TFnoGPU.py
python3 sine_benchmark_fixed_twenty_break_points.py
python3 sine_benchmark_fixed_twenty_break_points_TFnoGPU.py
python3 plot_results.py

asi1024 · 2019-05-05T18:28:33Z

Could you fix for the interface compatibility?

asi1024 · 2019-05-05T18:28:41Z

Is it possible to add other return values with no or very few overheads?

cjekel · 2019-05-05T22:16:43Z

@asi1024

Could you fix for the interface compatibility?

Is this the return other values?

Is it possible to add other return values with no or very few overheads?

I added the other return values, and now cupy.linalg.lstsq is 100% compatible with numpy.linalg.lstsq. This does add overhead, but now matches numpy's syntax.

The new overhead comes from calculating the sum of square of the residuals (L-2 norm):

    if rank != n or m <= n:
        resids = cupy.array([], dtype=a.dtype)
    elif b.ndim > 1:
        # note that this should be the same (and faster?) than the next couple of lines
        # e = b - core.dot(a, x)
        # resids = cupy.diagonal(core.dot(e.T, e))
        k = b.shape[1]
        resids = cupy.zeros(k, dtype=a.dtype)
        for i in range(k):
            e = b[:, i] - core.dot(a, x[:, i])
            resids[i] = core.dot(e.T, e)
    else:
        e = b - core.dot(a, x)
        resids = core.dot(e.T, e).reshape(-1)

asi1024 · 2019-05-07T09:45:01Z

tests/cupy_tests/linalg_tests/test_solve.py

+        cupy.testing.assert_allclose(rank_cpu, rank_gpu, atol=1e-3)
+        cupy.testing.assert_allclose(s_cpu, s_gpu, atol=1e-3)
+        cupy.testing.assert_array_equal(a_gpu_copy, a_gpu)
+        cupy.testing.assert_array_equal(b_gpu_copy, b_gpu)


Does not this implementation ensure that the result of cupy.lstsq is equal to that of numpy.lstsq? In other words, it ensures only ||ax-b||^2 are equal to each other?
If so, could you fix to check if ||ax-b||^2 is close?

I check if ||ax-b||^2 is close with cupy.testing.assert_allclose(resids_cpu, resids_gpu, atol=1e-3)

I'm not sure about atol=1e-3, I picked this because it was the same value for tests of cupy.linalg.pinv which also uses svd.

Does not this implementation ensure that the result (the first element of the return values) of cupy.lstsq is equal to that of numpy.lstsq?

The implementation ensures that each element in the result is close, but not equal. They won't be equal in a programming, because the backends are different.

Edit: To clarify, the implementation checks that all elements in the results of numpy are close to cupy.

Why @condition.retry(10) at L157 is needed? One of the result value is often distant from the expected value?

Why @condition.retry(10) at L157 is needed? One of the result value is often distant from the expected value?

Long response #2165 (comment)

cupy/linalg/solve.py

tests/cupy_tests/linalg_tests/test_solve.py

Changes: 1. Raise numpy.linalg.LinAlgError on invalid dimmensions. 2. Remove for loop for calculation of residuals in k dimmension. 3. Clarify tests with comments and new test names. 4. Modify tests to look for numpy.linalg.LinAlgError.

cjekel · 2019-05-08T17:14:55Z

Why @condition.retry(10) at L157 is needed? One of the result value is often distant from the expected value?

Sometimes test fail, in particular cupy.testing.assert_allclose(x_cpu, x_gpu, atol=1e-3) which maybe fails every 2/100 runs. The obvious case is when a is singular. However the following case is not as simple

import numpy as np
a = np.array([[2., 5., 1., 2.],
              [2., 2., 3., 8.],
              [4., 8., 1., 8.],
              [1., 3., 5., 1.]], dtype=np.float32)
b = np.array((9., 6., 0., 3.), dtype=np.float32)

where lstsq is solved in the following manner

x, resid, rank, s = np.linalg.lstsq(a, b)

produces (x_cpu == numpy.linalg.lstsq solution, x_gpu == cupy.linalg.lstsq solution)

AssertionError:
Not equal to tolerance rtol=1e-07, atol=0.001

Mismatch: 50%
Max absolute difference: 0.0032959
Max relative difference: 1.6645674e-05
 x_cpu: array([198.      , -64.71429 ,   6.857143, -35.142857], dtype=float32)
 x_gpu: array([198.0033  , -64.71534 ,   6.857244, -35.14343 ], dtype=float32)

Should I make the tests deterministic to avoid such failures? this could be done by supplying a random seed for each configuration of a and b. or is it okay to retry when a test fails?

asi1024 · 2019-05-13T04:22:41Z

Yes, It seems preferred to make the tests deterministic.
This implementation has a condition with rank != n or m <= n, so the tests should have both regular and singular cases.
Additionally, when the given matrix is singular, the first element of the return value certainly does not have to be equal (or close) to the NumPy's one, but the other elements have to be.

cjekel · 2019-05-14T16:08:21Z

@asi1024 Thanks for your recommendations.

Yes, It seems preferred to make the tests deterministic.

I remove the retry condition, and use fixed random seeds to generate the arrays now.

This implementation has a condition with rank != n or m <= n, so the tests should have both regular and singular cases.
Additionally, when the given matrix is singular, the first element of the return value certainly does not have to be equal (or close) to the NumPy's one, but the other elements have to be.

The tests have been reworked to generate a singular matrix. If a singular matrix was generated, the tests no longer check if the first return element is close to NumPy solution. All other return elements are still checked.

asi1024 · 2019-05-16T09:52:18Z

Jenkins, test this please.

pfn-ci-bot · 2019-05-16T09:52:26Z

Successfully created a job for commit cc5012c:

Dashboard for commit cc5012c

chainer-ci · 2019-05-16T11:00:06Z

Jenkins CI test (for commit cc5012c, target branch master) succeeded!

asi1024 · 2019-05-17T04:19:17Z

LGTM. Thank you for the PR!

add linalg.lstsq

ae1c3c8

asi1024 added the cat:feature New features/APIs label May 5, 2019

cjekel added 2 commits May 5, 2019 17:31

make lstsq 100% compatible with numpy

6334915

remove note about only returning part of numpy.linalg.lstsq

72c7ef1

niboshi assigned asi1024 May 7, 2019

asi1024 requested changes May 7, 2019

View reviewed changes

address review comments

e5b2128

Changes: 1. Raise numpy.linalg.LinAlgError on invalid dimmensions. 2. Remove for loop for calculation of residuals in k dimmension. 3. Clarify tests with comments and new test names. 4. Modify tests to look for numpy.linalg.LinAlgError.

niboshi mentioned this pull request May 8, 2019

Allow copying in the format cupy_array[:] = numpy_array #2079

Merged

cjekel added 2 commits May 8, 2019 08:47

testing lstsq: rename result to x to match returns; reduce atol to 1e-5

ec3c2ff

lstsq testing: use assertEqual on integers

74c15f7

revert tests back to atol=1e-3

841185a

cjekel added 2 commits May 14, 2019 11:14

make lstsq tests deterministic

5090585

improve construction of singular a matrix

cc5012c

asi1024 added this to the v7.0.0b1 milestone May 16, 2019

asi1024 approved these changes May 16, 2019

View reviewed changes

asi1024 merged commit d9d5176 into cupy:master May 17, 2019

kmaehashi mentioned this pull request Jul 19, 2019

Add new functions to API reference #2308

Merged

Uh oh!

Conversation

cjekel commented Apr 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asi1024 commented May 5, 2019

Uh oh!

asi1024 commented May 5, 2019

Uh oh!

cjekel commented May 5, 2019

Uh oh!

asi1024 May 7, 2019

Choose a reason for hiding this comment

Uh oh!

cjekel May 7, 2019

Choose a reason for hiding this comment

Uh oh!

asi1024 May 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cjekel May 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

asi1024 May 8, 2019

Choose a reason for hiding this comment

Uh oh!

cjekel May 8, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cjekel commented May 8, 2019

Uh oh!

asi1024 commented May 13, 2019

Uh oh!

cjekel commented May 14, 2019

Uh oh!

asi1024 commented May 16, 2019

Uh oh!

pfn-ci-bot commented May 16, 2019

Uh oh!

chainer-ci commented May 16, 2019

Uh oh!

asi1024 commented May 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cjekel commented Apr 28, 2019 •

edited

Loading

asi1024 May 8, 2019 •

edited

Loading

cjekel May 8, 2019 •

edited

Loading