[JIT] nvfuser CI fixes#75116
Conversation
* test_native_batch_norm_backward * test_reduction_empty_axes * test_register_fuser * test_category_rule [ghstack-poisoned]
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 98f2f4b (more details on the Dr. CI page):
🕵️ 1 new failure recognized by patternsThe following CI failures do not appear to be due to upstream breakages
|
* test_native_batch_norm_backward * test_reduction_empty_axes * test_register_fuser * test_category_rule [ghstack-poisoned]
* test_native_batch_norm_backward * test_reduction_empty_axes * test_register_fuser * test_category_rule [ghstack-poisoned]
jjsjann123
left a comment
There was a problem hiding this comment.
LGTM, unfortunately the reduction issues are not repro'ing on my local Pascal card, maybe I got lucky with the heuristics....
| torch._C._jit_nvfuser_clear_comparison_callback() | ||
|
|
||
| class TestPassManagerCudaFuser(JitTestCase): | ||
| def setUp(self): |
There was a problem hiding this comment.
Looks like we have some other test that leaked the flag and accidentally left nvfuser as enabled, or is it some thread racing that other tests running in parallel are turning the nvfuser switch?
There was a problem hiding this comment.
another test leaked the flag (my fault, fixed it on line 159)
@jjsjann123 I was assuming it was safe to skip since the other reduction tests are marked with the same, lmk if that's not accurate |
No it looks fine to me as well. I need to double check what went wrong with the repro on my local machine... This PR looks good to merge. |
|
@pytorchmergebot merge this please |
|
Hey @davidberard98. |
Summary: * test_native_batch_norm_backward * test_reduction_empty_axes * test_register_fuser * test_category_rule Pull Request resolved: #75116 Approved by: https://github.com/jjsjann123, https://github.com/eellison Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/83400e836ebbb43d0b5b3c8b881288ed50bf4803 Reviewed By: b0noI Differential Revision: D35404315 Pulled By: davidberard98 fbshipit-source-id: cd467428a4940f58af78705640443ad7d280a22e
Just for the record, I've been really dumb and somehow messed the device number on my machine. Unsurprisingly I've been running all my tests on a volta card instead of the pascal card.... No wonder why it does not repro.... 😮💨 |
* test_native_batch_norm_backward * test_reduction_empty_axes * test_register_fuser * test_category_rule Pull Request resolved: pytorch#75116 Approved by: https://github.com/jjsjann123, https://github.com/eellison
Stack from ghstack:
-> [JIT] nvfuser CI fixes #75116
test_native_batch_norm_backward
test_reduction_empty_axes
test_register_fuser
test_category_rule