sparse gradcheck: reparametrize some tests to remove masked=True by nikitaved · Pull Request #98490 · pytorch/pytorch

nikitaved · 2023-04-06T09:46:49Z

Most of the sparse functions that work with sparse tensors assume that sparse is an optimization, so a green check with masked=False will imply success with masked=True. Functions that assume the sparse semantics and do not explicitly ignore grads outside of the sparse pattern can be re-parametrized with torch.sparse_mask so that the gradcheck succeeds when masked=False. Hence, we can remove masked=True altogether.

Stack from ghstack (oldest at bottom):

cc @alexsamardzic @pearu @cpuhrsch @amjames @bhosmer @ezyang @albanD @zou3519 @gqchen @soulitzer @lezcano @Varal7

[ghstack-poisoned]

pytorch-bot · 2023-04-06T09:46:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/98490

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e1bcfa2:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: c740936 Pull Request resolved: #98490

…d=True" [ghstack-poisoned]

ghstack-source-id: 63e420c Pull Request resolved: #98490

test/test_sparse.py

…d=True" [ghstack-poisoned]

ghstack-source-id: f95d614 Pull Request resolved: #98490

…d=True" [ghstack-poisoned]

ghstack-source-id: dc078f3 Pull Request resolved: #98490

…d=True" [ghstack-poisoned]

ghstack-source-id: 92c12d1 Pull Request resolved: #98490

…d=True" [ghstack-poisoned]

pearu · 2023-06-19T12:33:19Z

Most of the sparse functions that work with sparse tensors assume that sparse is an optimization, so a green check with masked=False will imply success with masked=True.

Consider the function torch.mm that supports sparse inputs and it should use non-masked semantics. However, your statement does not hold for the following example case:

>>> a = torch.tensor([[0, 1], [2, 3]], dtype=torch.float64).to_sparse().requires_grad_(True)
>>> torch.autograd.gradcheck(lambda x: torch.mm(x, x).to_dense(masked_grad=False), (a,), masked=False)
True
>>> torch.autograd.gradcheck(lambda x: torch.mm(x, x).to_dense(masked_grad=True), (a,), masked=True)
<snip>
torch.autograd.gradcheck.GradcheckError: Jacobian mismatch for output 0 with respect to input 0,
numerical:tensor([[0.0000, 0.0000, 0.0000, 0.0000],
        [2.0000, 3.0000, 0.0000, 2.0000],
        [1.0000, 0.0000, 3.0000, 1.0000],
        [0.0000, 1.0000, 2.0000, 6.0000]], dtype=torch.float64)
analytical:tensor([[0., 1., 2., 0.],
        [2., 3., 0., 2.],
        [1., 0., 3., 1.],
        [0., 1., 2., 6.]], dtype=torch.float64)

The statement holds when the input sparse tensor is a full tensor:

>>> a = torch.tensor([[10, 1], [2, 3]], dtype=torch.float64).to_sparse().requires_grad_(True)
>>> torch.autograd.gradcheck(lambda x: torch.mm(x, x).to_dense(masked_grad=True), (a,), masked=True)
True

Next, consider torch.sparse.mm that implements mm with masked semantics:

>>> a = torch.tensor([[0, 1], [2, 3]], dtype=torch.float64).to_sparse().requires_grad_(True)
>>> torch.autograd.gradcheck(lambda x: torch.sparse.mm(x, x).to_dense(masked_grad=True), (a,), masked=True)
True

but with masked=False, the gradcheck will fail:

>>> torch.autograd.gradcheck(lambda x: torch.sparse.mm(x, x).to_dense(masked_grad=False), (a,), masked=False)
<snip>
torch.autograd.gradcheck.GradcheckError: Jacobian mismatch for output 0 with respect to input 0,
numerical:tensor([[0.0000, 1.0000, 2.0000, 0.0000],
        [2.0000, 3.0000, 0.0000, 2.0000],
        [1.0000, 0.0000, 3.0000, 1.0000],
        [0.0000, 1.0000, 2.0000, 6.0000]], dtype=torch.float64)
analytical:tensor([[0., 0., 0., 0.],
        [2., 3., 0., 2.],
        [1., 0., 3., 1.],
        [0., 1., 2., 6.]], dtype=torch.float64)

unless the input sparse tensor is full:

>>> a = torch.tensor([[10, 1], [2, 3]], dtype=torch.float64).to_sparse().requires_grad_(True)
>>> torch.autograd.gradcheck(lambda x: torch.sparse.mm(x, x).to_dense(masked_grad=False), (a,), masked=False)
True

Based on the above, I cannot confirm that masked=True can be removed.

…d=True" Most of the sparse functions that work with sparse tensors assume that sparse is an optimization, so a green check with `masked=False` will imply success with `masked=True`. Functions that assume the sparse semantics and do not explicitly ignore grads outside of the sparse pattern can be re-parametrized with `torch.sparse_mask` so that the gradcheck succeeds when `masked=False`. Hence, we can remove `masked=True` altogether. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved · 2023-06-19T14:11:40Z

@pearu, it can be once the gradients are properly mapped to the manifold with sparse_mask. sparse.mm is just a composition of torch.mm and sparse_mask. More specifically, torch.sparse.mm should be equivalent to lambda x, y: torch.mm(x.sparse_mask(x), y.sparse_mask(y)). Sometimes we also want to restrict the in-flowing gradients, so one other option is to have lambda x, y: res = torch.mm(x.sparse_mask(x), y.sparse_mask(y)); res.sparse_mask(res). A combination of sparse_mask and mm gives us much more that just torch.sparse.mm.

Since masked=False performs a densification of sparse inputs, we can still test this function with

x_mask = x.detach().clone()
y_mask = y.detach().clone()

def mm(x, y):
    x = x.sparse_mask(x_mask) # project x onto x_mask, the in-flowing grad will have the same indices as x_mask
    y = y.sparse_mask(y_mask) # project y onto y_mask, the in-flowing grad will have the same indices as y_mask
    res = torch.mm(x, y)
    # make sure that in-flowing grads have the same indices as res
    return res.sparse_mask(res)

gradcheck(lambda x, y: mm(x, y).to_dense(masked_grad=False), (x, y), masked=False)

…d=True" Most of the sparse functions that work with sparse tensors assume that sparse is an optimization, so a green check with `masked=False` will imply success with `masked=True`. Functions that assume the sparse semantics and do not explicitly ignore grads outside of the sparse pattern can be re-parametrized with `torch.sparse_mask` so that the gradcheck succeeds when `masked=False`. Hence, we can remove `masked=True` altogether. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

pearu · 2023-06-22T16:58:50Z

it can be once the gradients are properly mapped to the manifold with sparse_mask.
sparse.mm is just a composition of torch.mm and sparse_mask

I agree. This is what masked tensor support should handle, that is, torch.mm on masked tensors is equivalent to torch.mm and torch.sparse_mask combination on value-mask pairs as you exemplified.

torch.sparse.mm on sparse tensors is equivalent to torch.mm on masked tensors where the masks are defined by the sparsity patterns of inputs. Recall, defining the mask via sparsity pattern is something we want to get rid of eventually.

Our aim is to deprecate/eliminate torch.sparse.mm (together with the corresponding backward function) in favor of supporting masked tensors in torch.mm. Once we have this support and have applied the necessary deprecations procedures to handle BC-breaking changes (e.g. defining torch.sparse.mm in terms of torch.mm and converting sparse tensors to masked tensors), the masked kw argument can safely be removed from gradcheck.

Until then, removing the usage of masked kw argument as in this PR, is premature, IMHO. Especially when this is achieved by removing tests that exercise the masked=True case. When removing such tests, we risk that any future changes to gradcheck may introduce undetectable bugs for the gradcheck(..., masked=True) support which we are not ready to drop yet.

In general, we should deprecate a feature before removing the corresponding tests. In this PR, tests are removed but deprecating the feature is not immediately possible because masked tensors support is not ready yet.

…d=True" Most of the sparse functions that work with sparse tensors assume that sparse is an optimization, so a green check with `masked=False` will imply success with `masked=True`. Functions that assume the sparse semantics and do not explicitly ignore grads outside of the sparse pattern can be re-parametrized with `torch.sparse_mask` so that the gradcheck succeeds when `masked=False`. Hence, we can remove `masked=True` altogether. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved · 2023-06-23T11:41:10Z

I am in no rush to merge. It serves as a proof of concept: gradcheck does not need masked=True as it complicates things, is error-prune when it comes to implementing backward formulas, and it gives a false feeling of security with gradients being correct upon returning True.

…d=True" Most of the sparse functions that work with sparse tensors assume that sparse is an optimization, so a green check with `masked=False` will imply success with `masked=True`. Functions that assume the sparse semantics and do not explicitly ignore grads outside of the sparse pattern can be re-parametrized with `torch.sparse_mask` so that the gradcheck succeeds when `masked=False`. Hence, we can remove `masked=True` altogether. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

github-actions · 2023-09-01T14:33:42Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

sparse gradcheck: reparametrize some tests to remove masked=True

738b234

[ghstack-poisoned]

nikitaved mentioned this pull request Apr 6, 2023

sparse.mm backward: performance improvements #94991

Closed

nikitaved mentioned this pull request Apr 6, 2023

sparse_mask: backward support for sparse lhs #95165

Closed

pytorch-bot bot added the topic: not user facing topic category label Apr 6, 2023

nikitaved added a commit that referenced this pull request Apr 6, 2023

sparse gradcheck: reparametrize some tests to remove masked=True

ce670c1

ghstack-source-id: c740936 Pull Request resolved: #98490

pytorchbot added the open source label Apr 6, 2023

nikitaved marked this pull request as draft April 6, 2023 10:07

Update on "sparse gradcheck: reparametrize some tests to remove maske…

62dd0cd

…d=True" [ghstack-poisoned]

nikitaved mentioned this pull request Apr 11, 2023

sparse.sum backward: short circuit on zero/empty grad #98838

Closed

nikitaved added a commit that referenced this pull request Apr 11, 2023

sparse gradcheck: reparametrize some tests to remove masked=True

25fd70b

ghstack-source-id: 63e420c Pull Request resolved: #98490

nikitaved commented Apr 11, 2023

View reviewed changes

test/test_sparse.py Show resolved Hide resolved

Update on "sparse gradcheck: reparametrize some tests to remove maske…

bffc6bb

…d=True" [ghstack-poisoned]

Update on "sparse gradcheck: reparametrize some tests to remove maske…

7080153

…d=True" [ghstack-poisoned]

Update on "sparse gradcheck: reparametrize some tests to remove maske…

aeb8886

…d=True" [ghstack-poisoned]

nikitaved added a commit that referenced this pull request Apr 11, 2023

sparse gradcheck: reparametrize some tests to remove masked=True

f13b8e7

ghstack-source-id: f95d614 Pull Request resolved: #98490

Update on "sparse gradcheck: reparametrize some tests to remove maske…

dd60a6b

…d=True" [ghstack-poisoned]

Update on "sparse gradcheck: reparametrize some tests to remove maske…

18a17d4

…d=True" [ghstack-poisoned]

nikitaved added a commit that referenced this pull request Apr 12, 2023

sparse gradcheck: reparametrize some tests to remove masked=True

2096ca9

ghstack-source-id: dc078f3 Pull Request resolved: #98490

Update on "sparse gradcheck: reparametrize some tests to remove maske…

c61d0bf

…d=True" [ghstack-poisoned]

nikitaved added a commit that referenced this pull request Apr 12, 2023

sparse gradcheck: reparametrize some tests to remove masked=True

72eafe6

ghstack-source-id: 92c12d1 Pull Request resolved: #98490

Update on "sparse gradcheck: reparametrize some tests to remove maske…

9c57d7a

…d=True" [ghstack-poisoned]

nikitaved mentioned this pull request Apr 12, 2023

relu/threshold backward for sparse: enable 0-nnz grads #98935

Closed

Update on "sparse gradcheck: reparametrize some tests to remove maske…

36f47d9

…d=True" [ghstack-poisoned]

Update on "sparse gradcheck: reparametrize some tests to remove maske…

169458f

…d=True" [ghstack-poisoned]

nikitaved added 4 commits June 20, 2023 16:15

nikitaved mentioned this pull request Jun 28, 2023

sparse_mask: backward support for sparse lhs (take 2) #104341

Closed

nikitaved added 13 commits June 28, 2023 12:46

pytorch deleted a comment from pytorch-bot bot Jul 3, 2023

github-actions bot added the Stale label Sep 1, 2023

github-actions bot closed this Oct 1, 2023

facebook-github-bot deleted the gh/nikitaved/35/head branch November 1, 2023 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse gradcheck: reparametrize some tests to remove masked=True#98490

sparse gradcheck: reparametrize some tests to remove masked=True#98490
nikitaved wants to merge 51 commits intogh/nikitaved/35/basefrom
gh/nikitaved/35/head

nikitaved commented Apr 6, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 6, 2023 •

edited

Loading

Uh oh!

Uh oh!

pearu commented Jun 19, 2023

Uh oh!

nikitaved commented Jun 19, 2023 •

edited

Loading

Uh oh!

pearu commented Jun 22, 2023

Uh oh!

nikitaved commented Jun 23, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nikitaved commented Apr 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/98490

✅ No Failures

Uh oh!

Uh oh!

pearu commented Jun 19, 2023

Uh oh!

nikitaved commented Jun 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pearu commented Jun 22, 2023

Uh oh!

nikitaved commented Jun 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nikitaved commented Apr 6, 2023 •

edited

Loading

pytorch-bot bot commented Apr 6, 2023 •

edited

Loading

nikitaved commented Jun 19, 2023 •

edited

Loading

nikitaved commented Jun 23, 2023 •

edited

Loading