Conversation
|
Stack from ghstack (oldest at bottom): |
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4192
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 6e387dc with merge base c49a60e ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
Summary: Adds memory pinning support for `MXTensor` and `NVFP4Tensor`, this is important for supporting offloading in `diffusers`. Fixes the torchao side of #4026 Still requires huggingface/diffusers#13276 to land for things to work e2e Test Plan: e2e: makes https://gist.github.com/vkuzo/9ca863e559eb8af18f6e6afb079bf74f work with mxfp8 and nvfp4, requires huggingface/diffusers#13276 (not yet landed) local: the new tests added in this PR pass ghstack-source-id: ff0b1a2 ghstack-comment-id: 4142562671 Pull-Request: #4192
Summary: Adds memory pinning support for `MXTensor` and `NVFP4Tensor`, this is important for supporting offloading in `diffusers`. Fixes the torchao side of #4026 Still requires huggingface/diffusers#13276 to land for things to work e2e Test Plan: e2e: makes https://gist.github.com/vkuzo/9ca863e559eb8af18f6e6afb079bf74f work with mxfp8 and nvfp4, requires huggingface/diffusers#13276 (not yet landed) local: the new tests added in this PR pass ghstack-source-id: 5ad4868 ghstack-comment-id: 4142562671 Pull-Request: #4192
Summary: Adds memory pinning support for `MXTensor` and `NVFP4Tensor`, this is important for supporting offloading in `diffusers`. Fixes the torchao side of #4026 Still requires huggingface/diffusers#13276 to land for things to work e2e Test Plan: e2e: makes https://gist.github.com/vkuzo/9ca863e559eb8af18f6e6afb079bf74f work with mxfp8 and nvfp4, requires huggingface/diffusers#13276 (not yet landed) local: the new tests added in this PR pass ghstack-source-id: 7165b48 ghstack-comment-id: 4142562671 Pull-Request: #4192
|
@claude review this please |
|
Claude finished @andrewor14's task in 2m 4s —— View job Code Review: Support pinning for mx and nvfp4 tensors
Overall this is a clean, straightforward PR. The pattern follows existing conventions in the codebase (e.g., Implementation ( The Implementation ( The Minor observations (non-blocking):
Tests Both test functions follow a solid pattern:
The NVFP4 test additionally parametrizes over Verdict: LGTM. Clean implementation, correct constructor calls, good test coverage, and consistent with existing patterns in the codebase. |
Summary: Adds memory pinning support for `MXTensor` and `NVFP4Tensor`, this is important for supporting offloading in `diffusers`. Fixes the torchao side of #4026 Still requires huggingface/diffusers#13276 to land for things to work e2e Test Plan: e2e: makes https://gist.github.com/vkuzo/9ca863e559eb8af18f6e6afb079bf74f work with mxfp8 and nvfp4, requires huggingface/diffusers#13276 (not yet landed) local: the new tests added in this PR pass ghstack-source-id: dbbdf11 ghstack-comment-id: 4142562671 Pull-Request: #4192
Summary:
Adds memory pinning support for
MXTensorandNVFP4Tensor, this isimportant for supporting offloading in
diffusers.Fixes the torchao side of #4026
Still requires huggingface/diffusers#13276 to
land for things to work e2e
cc @sayakpaul
Test Plan:
e2e: makes
https://gist.github.com/vkuzo/9ca863e559eb8af18f6e6afb079bf74f work with
mxfp8 and nvfp4,
requires huggingface/diffusers#13276 (not yet
landed)
local: the new tests added in this PR pass