[Bugfix] Fix ROCm UVA CPU weight offloading broken by #32993 by AndreasKaratzas · Pull Request #34543 · vllm-project/vllm

AndreasKaratzas · 2026-02-13T23:01:23Z

PR [Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation #32993 added UVA (Unified Virtual Addressing) support for CPU weight offloading but only included platform checks for CUDA and XPU in get_accelerator_view_from_cpu_tensor, causing a ValueError on ROCm.
The C++ layer already works on ROCm since cuda_view.cu is hipified automatically and the op registration is not ROCm-guarded. Only the Python-level platform check was missing.
Fix: add current_platform.is_rocm() to the existing CUDA branch, since ROCm shares the same device type and dispatch key.

Test plan

tests/kernels/core/test_uva.py
tests/basic_correctness/test_cpu_offload.py
tests/quantization/test_cpu_offload.py

…cpu_tensor Signed-off-by: Andreas Karatzas <akaratza@amd.com>

gemini-code-assist

Code Review

This pull request addresses a bug that prevented ROCm from using Unified Virtual Addressing (UVA) for CPU weight offloading. The fix is correct and straightforward, adding the necessary platform check for ROCm. I've included one suggestion to use an existing helper function, which will improve code maintainability.

gemini-code-assist · 2026-02-13T23:02:20Z

        assert cpu_tensor.is_pinned(), "CPU tensor must be pinned"
        return torch.ops._C.get_xpu_view_from_cpu_tensor(cpu_tensor)
-    elif current_platform.is_cuda():
+    elif current_platform.is_cuda() or current_platform.is_rocm():


For better maintainability and to align with existing patterns in this file (e.g., in aux_stream), it's preferable to use the is_cuda_alike() helper method. This method encapsulates the check for both CUDA and ROCm platforms, making the code cleaner and easier to update if more CUDA-like platforms are supported in the future.

Suggested change

elif current_platform.is_cuda() or current_platform.is_rocm():

elif current_platform.is_cuda_alike():

This is more declarative. I think it's better.

@AndreasKaratzas let's follow gemini suggestion to be elif current_platform.is_cuda_alike():. It is a known convention in vLLM. If it works for both cuda and rocm, we use elif current_platform.is_cuda_alike():

njhill

Thanks @AndreasKaratzas!

I was also wondering about is_cuda_alike() since we use that elsewhere. But don't feel too strongly and would be good to fix the CI asap!

AndreasKaratzas · 2026-02-13T23:54:25Z

Thanks @AndreasKaratzas!

I was also wondering about is_cuda_alike() since we use that elsewhere. But don't feel too strongly and would be good to fix the CI asap!

Yeah, I don't feel that strongly either tbh 😅 It's just that so far sometimes I've found it more useful for more explicitly defined conditionals, although cuda_alike is pretty well defined (at least at the moment). Tough one 😅

AndreasKaratzas · 2026-02-14T00:47:02Z

@njhill Entrypoints Integration (Responses API) is not affected by this PR. It is a flaky TG and is addressed in
#33949

Btw, if you got time to review/approve #33949 as well, would be great 😅

AndreasKaratzas · 2026-02-14T02:01:00Z

@njhill V1 e2e + Engine is also a known failure I think.

(vllm-project#34543) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

Fix ROCm support for UVA CPU offloading in get_accelerator_view_from_…

829af9e

…cpu_tensor Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas mentioned this pull request Feb 13, 2026

[Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation #32993

Merged

5 tasks

mergify Bot added rocm Related to AMD ROCm bug Something isn't working labels Feb 13, 2026

github-project-automation Bot added this to AMD Feb 13, 2026

github-project-automation Bot moved this to Todo in AMD Feb 13, 2026

gemini-code-assist Bot reviewed Feb 13, 2026

View reviewed changes

AndreasKaratzas mentioned this pull request Feb 13, 2026

[CI] Heavy refactoring of Voxtral multimodal audio model tests #34294

Merged

njhill approved these changes Feb 13, 2026

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 13, 2026

njhill enabled auto-merge (squash) February 13, 2026 23:51

yashwantbezawada mentioned this pull request Feb 14, 2026

[KVConnector] Auto-downgrade to PIECEWISE cudagraph mode for layerwise async ops #31057

Merged

vllm-bot merged commit a0638d0 into vllm-project:main Feb 14, 2026
49 of 53 checks passed

github-project-automation Bot moved this from Todo to Done in AMD Feb 14, 2026

AndreasKaratzas deleted the akaratza_fix_basic_tests branch February 14, 2026 04:10

This was referenced Feb 14, 2026

[Misc] Optimized check to encapsulate both CUDA and ROCm platforms #34549

Merged

[Kernels] Fix Helion GPU utils to use platform-agnostic device name API #34537

Merged

jiangkuaixue123 pushed a commit to jiangkuaixue123/vllm that referenced this pull request Apr 28, 2026

[Bugfix] Fix ROCm UVA CPU weight offloading broken by vllm-project#32993

ec3cb73

(vllm-project#34543) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix ROCm UVA CPU weight offloading broken by #32993#34543

[Bugfix] Fix ROCm UVA CPU weight offloading broken by #32993#34543
vllm-bot merged 1 commit intovllm-project:mainfrom
ROCm:akaratza_fix_basic_tests

AndreasKaratzas commented Feb 13, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Feb 13, 2026

Uh oh!

AndreasKaratzas Feb 13, 2026

Uh oh!

tjtanaa Feb 14, 2026

Uh oh!

njhill left a comment

Uh oh!

AndreasKaratzas commented Feb 13, 2026 •

edited

Loading

Uh oh!

AndreasKaratzas commented Feb 14, 2026

Uh oh!

AndreasKaratzas commented Feb 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	elif current_platform.is_cuda() or current_platform.is_rocm():
	elif current_platform.is_cuda_alike():

Uh oh!

Conversation

AndreasKaratzas commented Feb 13, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

tjtanaa Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndreasKaratzas commented Feb 14, 2026

Uh oh!

AndreasKaratzas commented Feb 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AndreasKaratzas commented Feb 13, 2026 •

edited by github-actions Bot

Loading

AndreasKaratzas commented Feb 13, 2026 •

edited

Loading