sparse_mask: backward support for sparse lhs (take 2) by nikitaved · Pull Request #104341 · pytorch/pytorch

nikitaved · 2023-06-28T10:27:34Z

This is a copy of #95165 with some bug fixes.

Stack from ghstack (oldest at bottom):

cc @alexsamardzic @pearu @cpuhrsch @amjames @bhosmer @ezyang @albanD @zou3519 @gqchen @soulitzer @lezcano @Varal7

[ghstack-poisoned]

pytorch-bot · 2023-06-28T10:27:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/104341

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f27da78:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

nikitaved · 2023-06-28T10:30:02Z

Sorry to bother you again with this one, @pearu, but could you please approve this one again?

nikitaved · 2023-06-28T11:19:32Z

@albanD , @soulitzer , could you please also approve it for good measure since @cpuhrsch is on PTO?

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

albanD

Looks good!

test/test_sparse.py

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

pearu

LGTM! Thanks, @nikitaved!

amjames

Thanks!

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved · 2023-06-29T12:00:29Z

Alright, it looks weird. Slowtests are failing in somewhat non-deterministic manner across shards. Is there a way to trigger these things locally? I have no issues when running gradcheck with fast_mode=False.

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

pearu · 2023-06-29T14:03:04Z

Alright, it looks weird. Slowtests are failing in somewhat non-deterministic manner across shards. Is there a way to trigger these things locally? I have no issues when running gradcheck with fast_mode=False.

Yes, try defining env variable PYTORCH_TEST_WITH_SLOW_GRADCHECK=1 prior running the gradcheck tests.

nikitaved · 2023-06-29T14:48:51Z

Thank you, @pearu! Now it fails locally, but only when masked=True. It seems I know why...

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved · 2023-06-29T15:01:47Z

Nice, problem solved! One more for masked=True being very confusing ;)

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved · 2023-06-30T13:37:12Z

The failures are unrelated :) I think it is good to go!

amjames · 2023-06-30T14:08:07Z

@pytorchbot merge

pytorchmergebot · 2023-06-30T14:11:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-06-30T14:11:15Z

Merge failed

Reason: 1 jobs have failed, first few of them are: slow / linux-bionic-cuda12.1-py3-gcc9-slow-gradcheck / test (default, 1, 4, linux.g5.4xlarge.nvidia.gpu)

Details for Dev Infra team

Raised by workflow job

nikitaved · 2023-06-30T14:11:38Z

@amjames , I will rebase, then we can merge :)

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved · 2023-07-03T14:10:11Z

@pytorchbot merge

pytorchmergebot · 2023-07-03T14:12:39Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

sparse_mask: backward support for sparse lhs (take 2)

24cb686

[ghstack-poisoned]

nikitaved requested review from albanD and soulitzer as code owners June 28, 2023 10:27

pytorch-bot bot added the release notes: sparse release notes category label Jun 28, 2023

This was referenced Jun 23, 2023

sparse gradcheck: reparametrize some tests to remove masked=True #98490

Closed

sparse gradcheck: retire gradcheck_semantics #99043

Closed

sparse.sum backward: fix gradient projection #99298

Closed

nikitaved requested a review from pearu June 28, 2023 10:29

nikitaved added module: sparse Related to torch.sparse module: autograd Related to torch.autograd, and the autograd engine in general ciflow/trunk Trigger trunk jobs on your pull request ciflow/slow labels Jun 28, 2023

pytorchbot added the open source label Jun 28, 2023

nikitaved added the topic: new features topic category label Jun 28, 2023

nikitaved added 3 commits June 28, 2023 12:46

Update on "sparse_mask: backward support for sparse lhs (take 2)"

d97c6ea

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

a40acbc

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

d82a1af

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

albanD approved these changes Jun 28, 2023

View reviewed changes

amjames reviewed Jun 28, 2023

View reviewed changes

test/test_sparse.py Outdated Show resolved Hide resolved

pearu reviewed Jun 28, 2023

View reviewed changes

test/test_sparse.py Outdated Show resolved Hide resolved

Update on "sparse_mask: backward support for sparse lhs (take 2)"

b90c8af

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

pearu approved these changes Jun 28, 2023

View reviewed changes

amjames approved these changes Jun 28, 2023

View reviewed changes

Update on "sparse_mask: backward support for sparse lhs (take 2)"

923fe66

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

7e61080

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

amjames mentioned this pull request Jun 29, 2023

sparse_mask: backward support for sparse lhs #95165

Closed

Update on "sparse_mask: backward support for sparse lhs (take 2)"

1998e5e

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

nikitaved added 4 commits June 29, 2023 15:03

Update on "sparse_mask: backward support for sparse lhs (take 2)"

c100711

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

b50c0c3

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

38020ad

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

e51deb0

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

pytorchmergebot added the merging label Jun 30, 2023

pytorchmergebot removed the merging label Jun 30, 2023

nikitaved added 2 commits June 30, 2023 14:12

Update on "sparse_mask: backward support for sparse lhs (take 2)"

9c5f7c3

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

Update on "sparse_mask: backward support for sparse lhs (take 2)"

f27da78

This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]

pytorchmergebot added the merging label Jul 3, 2023

pytorchmergebot added Merged and removed merging labels Jul 3, 2023

pytorchmergebot closed this in 437bc5b Jul 3, 2023

facebook-github-bot deleted the gh/nikitaved/59/head branch July 6, 2023 14:16

Conversation

nikitaved commented Jun 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/104341

✅ No Failures

Uh oh!

nikitaved commented Jun 28, 2023

Uh oh!

nikitaved commented Jun 28, 2023

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pearu left a comment

Choose a reason for hiding this comment

Uh oh!

amjames left a comment

Choose a reason for hiding this comment

Uh oh!

nikitaved commented Jun 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pearu commented Jun 29, 2023

Uh oh!

nikitaved commented Jun 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nikitaved commented Jun 29, 2023

Uh oh!

nikitaved commented Jun 30, 2023

Uh oh!

amjames commented Jun 30, 2023

Uh oh!

pytorchmergebot commented Jun 30, 2023

Merge started

Uh oh!

pytorchmergebot commented Jun 30, 2023

Merge failed

Uh oh!

nikitaved commented Jun 30, 2023

Uh oh!

nikitaved commented Jul 3, 2023

Uh oh!

pytorchmergebot commented Jul 3, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

nikitaved commented Jun 28, 2023 •

edited

Loading

pytorch-bot bot commented Jun 28, 2023 •

edited

Loading

nikitaved commented Jun 29, 2023 •

edited

Loading

nikitaved commented Jun 29, 2023 •

edited

Loading