sparse_mask: backward support for sparse lhs (take 2)#104341
sparse_mask: backward support for sparse lhs (take 2)#104341nikitaved wants to merge 14 commits intogh/nikitaved/59/basefrom
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/104341
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit f27da78: This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Sorry to bother you again with this one, @pearu, but could you please approve this one again? |
|
@albanD , @soulitzer , could you please also approve it for good measure since @cpuhrsch is on PTO? |
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
pearu
left a comment
There was a problem hiding this comment.
LGTM! Thanks, @nikitaved!
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
|
Alright, it looks weird. Slowtests are failing in somewhat non-deterministic manner across shards. Is there a way to trigger these things locally? I have no issues when running gradcheck with |
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
Yes, try defining env variable |
|
Thank you, @pearu! Now it fails locally, but only when |
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
|
Nice, problem solved! One more for |
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
|
The failures are unrelated :) I think it is good to go! |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Merge failedReason: 1 jobs have failed, first few of them are: slow / linux-bionic-cuda12.1-py3-gcc9-slow-gradcheck / test (default, 1, 4, linux.g5.4xlarge.nvidia.gpu) Details for Dev Infra teamRaised by workflow job |
|
@amjames , I will rebase, then we can merge :) |
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
This is a copy of #95165 with some bug fixes. cc alexsamardzic pearu cpuhrsch amjames bhosmer ezyang albanD zou3519 gqchen soulitzer Lezcano Varal7 [ghstack-poisoned]
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
This is a copy of #95165 with some bug fixes.
Stack from ghstack (oldest at bottom):
cc @alexsamardzic @pearu @cpuhrsch @amjames @bhosmer @ezyang @albanD @zou3519 @gqchen @soulitzer @lezcano @Varal7