[c10] Move P2P access logic from ATen to c10#173571
[c10] Move P2P access logic from ATen to c10#173571minsii wants to merge 1 commit intopytorch:mainfrom
Conversation
This PR needs a
|
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/173571
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit f342c43 with merge base 02a87d7 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
8aba31c to
3495f7e
Compare
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
3495f7e to
3c46061
Compare
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
3c46061 to
1ef4deb
Compare
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
1ef4deb to
f342c43
Compare
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
Summary: Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Reviewed By: ngimel Differential Revision: D91506414
|
@pytorchbot merge (Initiating merge automatically since Phabricator Diff has merged) |
Merge failedReason: This PR needs a If not, please add the To add a label, you can comment to pytorchbot, for example For more information, see Details for Dev Infra teamRaised by workflow job |
This PR needs a
|
|
@pytorchbot label "topic: not user facing" |
|
closing for rebase |
Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Original diff by @minsii #173571 Bifferential Revision: [D92675476](https://our.internmc.facebook.com/intern/diff/D92675476/) Pull Request resolved: #174582 Approved by: https://github.com/Skylion007
Refactor PeerToPeerAccess by moving the core implementation from aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access queries available at the c10 layer without requiring ATen dependencies. The ATen layer now provides thin wrappers that ensure CUDA lazy initialization before forwarding to c10. This separation allows lower-level CUDA code to query P2P capabilities without pulling in ATen context machinery. Original diff by @minsii pytorch#173571 Bifferential Revision: [D92675476](https://our.internmc.facebook.com/intern/diff/D92675476/) Pull Request resolved: pytorch#174582 Approved by: https://github.com/Skylion007
Summary:
Refactor PeerToPeerAccess by moving the core implementation from
aten/src/ATen/cuda to c10/cuda. This makes P2P and fabric access
queries available at the c10 layer without requiring ATen dependencies.
The ATen layer now provides thin wrappers that ensure CUDA lazy
initialization before forwarding to c10. This separation allows
lower-level CUDA code to query P2P capabilities without pulling in
ATen context machinery.
Reviewed By: ngimel
Differential Revision: D91506414