As discussed in #43257, when working on #43092, I observed that after I changed some unrelated things, the test_reduction_split starts to fail with OOM. The failure only happens when running this test together with other tests. When ran standalone, this test does not fail. This might suggest that the test suite is holding CUDA memory between tests, which it probably shouldn't do.
cc @mruberry @VitalyFedyunin
As discussed in #43257, when working on #43092, I observed that after I changed some unrelated things, the
test_reduction_splitstarts to fail with OOM. The failure only happens when running this test together with other tests. When ran standalone, this test does not fail. This might suggest that the test suite is holding CUDA memory between tests, which it probably shouldn't do.cc @mruberry @VitalyFedyunin