You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add NCCL comm suspend, resume and memory stats (#176300)
Added three new APIs:
`backend.suspend()`: free the memory held by the backend/communicator
`backend.resume()`: restore the memory needed by the backend/communicator
`backend.memory_stats()`: return memory usage info of the backend.
```
pytest -vs test/distributed/test_c10d_nccl.py -k test_get_memory_stats
pytest -vs test/distributed/test_c10d_nccl.py -k test_suspend
pytest -vs test/distributed/test_c10d_nccl.py -k test_resume
```
Pull Request resolved: #176300
Approved by: https://github.com/Skylion007
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>
0 commit comments