[Feature] Adding pip install Support for sgl-kernel for ROCm by RohitNagraj · Pull Request #1 · hubertlu-tw/sglang

RohitNagraj · 2025-09-24T03:24:01Z

Motivation

To enable pip install sgl-kernel support for ROCm.

Test package is uploaded on Test PyPi here: sgl-kernel-rocm630 and sgl-kernel-rocm700.

Modifications

Added sgl-kernel/CMakeLists_rocm.txt that would be used by CMake for building wheel for ROCm, similar to NVIDIA's CMakeLists.txt.
Added sgl-kernel/build_rocm.sh similar to NVIDIA's sgl-kernel/build.sh that builds the ROCm wheel inside a docker image (used by Github Workflows).
Added sgl-kernel/rename_wheels_rocm.sh similar to existing NVIDIA's sgl-kernel/rename_wheels.sh to rename wheels to the standard format.
Added sgl-kernel/rocm_hipify.py that hipifies the sources using PyTorch's built in hipify module. This is required by CMake for build. Did not use hipify-clang inside CMakeLists_rocm.txt as it requires CUDA to be available.
Modified scripts/ci/amd_ci_install_dependency.sh to use sgl-kernel-rocm<version> hosted on TestPyPi for CI.
Updated .github/workflows/release-whl-kernel.yml to build and push ROCm 6.3 and ROCm 7.0 wheels to SGLang's index (https://docs.sglang.ai/whl/)
Added ROCm support to scripts/ci/update_kernel_whl_index.py to update sgl-kernel wheel index.

Testing

Environments:
- ROCm 6.3 on MI300x (using rocm/sgl-dev:v0.5.3rc0-rocm630-mi30x-20250930)
- ROCm 7.0 on MI300x (using rocm/sgl-dev:v0.5.3rc0-rocm700-mi30x-20250930)
- ROCm 7.0 on MI350x (using rocm/sgl-dev:v0.5.3rc0-rocm700-mi35x-20250930)

Procedure

Run the docker container using scripts/ci/amd_ci_start_container.sh (modifying the image name when required)
Install dependencies with scripts/ci/amd_ci_install_dependency.sh (using sgl-kernel from TestPyPi as the changes show in this PR. )
Run sgl-kernel unit tests from .github/workflows/pr-test-amd.yml using the command docker exec -w /sglang-checkout/sgl-kernel/tests ci_sglang python3 -m pytest test_moe_align.py test_moe_topk_softmax.py test_apply_token_bitmask_inplace.py test_activation.py test_kvcacheio.py speculative/test_eagle_utils.py
Run a small E2E test test/srt/test_mla.py using the command docker exec -w /sglang-checkout/test/srt ci_sglang bash -c "SGLANG_AMD_CI=1 SGLANG_IS_IN_CI=1 SGLANG_USE_AITER=1 python3 run_suite.py --suite per-commit-amd --range-begin 46 --range-end 47". The Range picks only test_mla.py to run.

Checklist

Format your code according to the Format code with pre-commit.
Add code support to build ROCm wheels for sgl-kernel
Test pip install functionality inside docker for ROCm 6.3 and ROCm 7.0 on MI300x and MI350x.
Run e2e test using test_mla.py.
Add wheel release as part of the Github Workflow.
Test functionality on Python 3.10, 3.11, 3.12 and 3.13 with install from TestPyPi.
Update documentation according to Write documentations.

hubertlu-tw · 2025-09-24T03:34:54Z

+#!/bin/bash
+set -ex
+
+DOCKER_IMAGE=lmsysorg/sglang:v0.5.3rc0-rocm630-mi30x


This will require regular updates. Any better way to handle it?

True. Ideally we should have a sglang:rocm-latest tag on dockerhub. But since that wasn't available, I took inspiration from Dockerfile.rocm to hardcode the latest. Any suggestions to do it better?

Is this script for building docker images for CI test? If yes, please check how NVIDIA’s CI test sgl_kernel: https://github.com/sgl-project/sglang/blob/main/.github/workflows/pr-test.yml

Yes. This is for CI build. In NVIDIA's build.sh, they use PyTorch image as the base. However, for AMD, we need dependencies like AITER installed, ROCm/PyTorch would not work out of the box as a base-image. Let me change the base-image to PyTorch and install the dependencies within the build script. That way, we avoid hardcoding the image to a specific version.

Did you look into how we run SGLang CI in the confluence page or the scripts in SGLang specifically these two scripts:

https://github.com/sgl-project/sglang/blob/main/.github/workflows/pr-test-amd.yml#L317-L345

https://github.com/sgl-project/sglang/blob/main/scripts/ci/amd_ci_install_dependency.sh

For just sgl-kernel tests, there is no aiter dependency. However, we need to find a way to handle it for

pip install --upgrade pip pip install uv uv pip install "sglang[all_hip]>=0.5.3rc0"

My bad, the build_rocm.sh is not for CI but for release-whl-kernel.yml github workflow. For CI, the install happens in amd_ci_install_dependency.sh, and just changing a small part in it to use the wheel from pip is enough once the wheel is on pip. However, this build_rocm.sh script is for release-whl-kernel.yml, which needs a docker image to compile and build the wheel.

NVIDIA's equivalent is found in build.sh that is called in the release-whl-kernel.yml.

Turns out we can use rocm/pytorch as the base image for it. So I'll change the base-image to that.

For

pip install --upgrade pip pip install uv uv pip install "sglang[all_hip]>=0.5.3rc0"

once we have the sgl-kernel wheel on Pypi, it should be pretty simple.

The current flow of how I think about this is:

This PR (sets up scripts to build wheel)

Setup a PyPi repo for official sgl-kernel-rocm

Setup Github Workflow similar to release-whl-kernel.yml for ROCm.

Update the CI scripts amd_ci_install_dependency.sh and sglang/python/pyproject.toml to use these wheels from PyPi with pip install.

hubertlu-tw · 2025-09-26T18:23:19Z

@RohitNagraj please mention sgl-kernel pip install in your PR's title.

RohitNagraj · 2025-10-16T18:00:20Z

Tagged torch versions to build kernel.

Remove python/pyproject_rocm.toml and adjust docs/platforms/amd_gpu.md. These files were accidentally included from draft sgl-project#14802 and cause unnecessary cross-platform CI runs.

…w_asdict (sgl-project#13782)

…#15340) Co-authored-by: Thomas Wang <1am9trash@gmail.com>

…loading on MI325 (sgl-project#13760) Co-authored-by: Sabre Shao <sabre.shao@amd.com> Co-authored-by: Yusheng (Ethan) Su <yushengsu.thu@gmail.com> Co-authored-by: Hubert Lu <Hubert.Lu@amd.com> Co-authored-by: xsun <sunxiao04@gmail.com>

Co-authored-by: Liangsheng Yin <lsyincs@gmail.com>

Co-authored-by: bingxche <Bingxu.Chen@amd.com>

Co-authored-by: Sai Enduri <saimanas.enduri@amd.com>

RohitNagraj changed the title ~~Adding pip install support for ROCm with pre-built wheels~~ [Feature] Adding pip install support for ROCm with pre-built wheels Sep 24, 2025

RohitNagraj changed the title ~~[Feature] Adding pip install support for ROCm with pre-built wheels~~ [Feature] Adding pip install support for ROCm Sep 24, 2025

hubertlu-tw requested changes Sep 24, 2025

View reviewed changes

RohitNagraj closed this Sep 25, 2025

RohitNagraj reopened this Sep 26, 2025

RohitNagraj force-pushed the rocm-pip-install-dev branch 2 times, most recently from 2553eaf to 3ec449a Compare September 26, 2025 18:12

RohitNagraj changed the title ~~[Feature] Adding pip install support for ROCm~~ [Feature] Adding sgl-kernel Wheel Build Support for ROCm Sep 26, 2025

RohitNagraj changed the title ~~[Feature] Adding sgl-kernel Wheel Build Support for ROCm~~ [Feature] Adding pip install Support for sgl-kernel for ROCm Sep 26, 2025

RohitNagraj changed the title ~~[Feature] Adding pip install Support for sgl-kernel for ROCm~~ [1/2] [Feature] Adding pip install Support for sgl-kernel for ROCm Sep 28, 2025

RohitNagraj force-pushed the rocm-pip-install-dev branch from 68de54b to 6e94500 Compare October 10, 2025 19:59

RohitNagraj changed the title ~~[1/2] [Feature] Adding pip install Support for sgl-kernel for ROCm~~ [Feature] Adding pip install Support for sgl-kernel for ROCm Oct 15, 2025

RohitNagraj force-pushed the rocm-pip-install-dev branch 2 times, most recently from 5da632b to 622c2bd Compare October 21, 2025 23:36

github-actions Bot added documentation Improvements or additions to documentation sgl-kernel amd labels Nov 10, 2025

RohitNagraj mentioned this pull request Nov 10, 2025

[WIP] [Feature] Add pyproject_rocm.toml for end-to-end ROCm pip installation support #3

Closed

5 tasks

github-actions Bot added model-gateway dependencies Multi-modal diffusion lora quant npu blackwell deepseek hicache labels Dec 9, 2025

merrymercy and others added 20 commits December 18, 2025 23:06

Update readme (sgl-project#15425)

f228b66

Add customized sampler registration (sgl-project#15423)

1739409

Added ROCm wheel build files for sgl-kernel

b349c28

Updated torch version used for rocm sgl-kernel wheel

69431d9

Removed redundant code

13a66ce

Added AMDGPU_TARGET env variable

e0b687b

Fixed indentation

20f4c0b

Renamed rocm630 to rocm640

a84f587

Updated wheel name from rocm 6.3 to 6.4

6f04705

Fixed merge conflict

2a2239c

Updated sources to match latest

f0e9d98

Updated env variable name to match the new change

aa26e9d

Added logic to create multiple directories for sglang/whl index

8ae26bc

Removed rocm640 support

79944d8

Added pyproject_rocm.toml

5a4ee3d

Updated docs

423bae9

Updated docs to remove PREBUILD_KERNELS=1 as default

4a3971d

Removed rocm640 support

9858cba

Align to torch/torchvision in current docker images

ed2e83c

Revert unintended ROCm packaging changes

3607d66

Remove python/pyproject_rocm.toml and adjust docs/platforms/amd_gpu.md. These files were accidentally included from draft sgl-project#14802 and cause unnecessary cross-platform CI runs.

akao-amd force-pushed the rocm-pip-install-dev branch from 8fc198a to 3607d66 Compare December 19, 2025 07:14

cocoshe and others added 8 commits December 19, 2025 15:19

[diffusion] refactor: refactor _build_req_from_sampling to use shallo…

0e869f0

…w_asdict (sgl-project#13782)

[amd] Add deterministic all-reduce kernel for AMD (ROCm) (sgl-project…

f2d64e6

…#15340) Co-authored-by: Thomas Wang <1am9trash@gmail.com>

[sgl-kernel] chore: update deepgemm version (sgl-project#13402)

65c0985

fix: unreachable error check in retraction (sgl-project#15433)

fb17845

Co-authored-by: Liangsheng Yin <lsyincs@gmail.com>

[AMD] Fix and add accuracy-test-2-gpu-amd back (sgl-project#15415)

a21aa87

Co-authored-by: bingxche <Bingxu.Chen@amd.com>

[AMD] add unit-test-backend-8-gpu-amd back (sgl-project#15253)

af780c5

Co-authored-by: Sai Enduri <saimanas.enduri@amd.com>

Merge branch 'main' into rocm-pip-install-dev

db17719

hubertlu-tw closed this Apr 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Adding pip install Support for sgl-kernel for ROCm#1

[Feature] Adding pip install Support for sgl-kernel for ROCm#1
RohitNagraj wants to merge 171 commits intohubertlu-tw:mainfrom
RohitNagraj:rocm-pip-install-dev

RohitNagraj commented Sep 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

hubertlu-tw Sep 24, 2025

Uh oh!

RohitNagraj Sep 24, 2025

Uh oh!

hubertlu-tw Sep 25, 2025

Uh oh!

RohitNagraj Sep 25, 2025 •

edited

Loading

Uh oh!

hubertlu-tw Sep 25, 2025

Uh oh!

RohitNagraj Sep 25, 2025

Uh oh!

RohitNagraj Sep 25, 2025

Uh oh!

hubertlu-tw commented Sep 26, 2025

Uh oh!

RohitNagraj commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

RohitNagraj commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Testing

Procedure

Checklist

Uh oh!

Uh oh!

hubertlu-tw Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

RohitNagraj Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

hubertlu-tw Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

RohitNagraj Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hubertlu-tw Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

RohitNagraj Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

RohitNagraj Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

hubertlu-tw commented Sep 26, 2025

Uh oh!

RohitNagraj commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

RohitNagraj commented Sep 24, 2025 •

edited

Loading

RohitNagraj Sep 25, 2025 •

edited

Loading