Add shell script to deploy LLM service on AMD GPU by leehyeoklee · Pull Request #2321 · cloud-barista/cb-tumblebug

leehyeoklee · 2026-02-20T07:22:47Z

🚀 Key Changes

This update extends LLM inference and serving capabilities to AMD GPUs, in addition to the existing NVIDIA support.

We have added shell scripts to automate ROCm driver installation, vLLM/Ollama environment setup, and model serving(vLLM). This allows users on cloud environments (Azure, AWS) with AMD GPUs to deploy LLM services without complex manual configuration.

✨ Implementation Details

1. ROCm Driver Installation (`installRocmDriver.sh`)

Installs the ROCm (Radeon Open Compute) and AMD GPU driver, which is essential for using AMD GPUs.

2. vLLM Environment Deployment (`deployvLLMAmd.sh`)

Configures the vLLM environment using the official rocm/vllm Docker image provided by AMD.
Automates Docker installation and HuggingFace cache directory setup

3. vLLM Model Serving (`servevLLMAmd.sh`)

Launches a specified HuggingFace model as a vLLM-powered, OpenAI-compatible API server.
Runs in a Docker container and includes features for stable operation, such as automatic shutdown of existing servers and health checks.

💻 Test Environment and Results

✅ Azure (Radeon PRO V710)

Result: Works Perfectly
After installing the ROCm 7.0.1 driver, model serving and inference via vLLM/Ollama were confirmed to be running smoothly.

Ollama

vLLM

seokho-son · 2026-02-20T08:42:51Z

@leehyeoklee Let's check if the suggested scripts can be (simply) merged with the existing scripts. :)

ROCm Driver Installation (installRocmDriver.sh)
vLLM Environment Deployment (deployvLLMAmd.sh)
vLLM Model Serving (servevLLMAmd.sh)

https://github.com/cloud-barista/cb-tumblebug/tree/main/scripts/usecases/llm

seokho-son · 2026-02-23T02:37:18Z

@leehyeoklee
Is this PR ready for additional review round?

leehyeoklee · 2026-02-23T07:13:28Z

@seokho-son

Yes, it's ready for another round of review.😊
I have unified the vLLM deployment and serving scripts to support both NVIDIA and AMD GPUs.

Additionally, I’ve created a single installGpuDriver.sh script that handles both GPU driver and CUDA/ROCm installations.

And I've confirmed that LLM models load and run correctly on both NVIDIA and AMD GPU VMs!

leehyeoklee · 2026-02-23T07:20:31Z

For AMD vLLM deployment, I referred to this documentation: https://docs.vllm.ai/en/stable/getting_started/installation/gpu/

Note:
Since Python 3.12 is required to use the current Pre-built wheels, I configured the script to install it when proceeding with an AMD GPU.

seokho-son · 2026-02-23T07:58:17Z

/approve

leehyeoklee requested review from seokho-son and yunkon-kim as code owners February 20, 2026 07:22

leehyeoklee added 4 commits February 23, 2026 16:08

Add shell script to deploy LLM service on AMD GPU

cdc880c

Unify NVIDIA and AMD vLLM deploy/serving scripts

eab3f56

Fix unified NVIDIA and AMD vLLM deploy scripts

c4f7c04

Add unified GPU driver and CUDA/ROCm install script

5742ef7

leehyeoklee force-pushed the improve-amd-gpu-llm-flow branch from 66ef178 to 5742ef7 Compare February 23, 2026 07:08

github-actions bot approved these changes Feb 23, 2026

View reviewed changes

github-actions bot added the approved This PR is approved and will be merged soon. label Feb 23, 2026

cb-github-robot merged commit 31cc77b into cloud-barista:main Feb 23, 2026
2 checks passed

leehyeoklee deleted the improve-amd-gpu-llm-flow branch February 23, 2026 08:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add shell script to deploy LLM service on AMD GPU#2321

Add shell script to deploy LLM service on AMD GPU#2321
cb-github-robot merged 4 commits intocloud-barista:mainfrom
leehyeoklee:improve-amd-gpu-llm-flow

leehyeoklee commented Feb 20, 2026

Uh oh!

seokho-son commented Feb 20, 2026

Uh oh!

seokho-son commented Feb 23, 2026

Uh oh!

leehyeoklee commented Feb 23, 2026 •

edited

Loading

Uh oh!

leehyeoklee commented Feb 23, 2026

Uh oh!

seokho-son commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

leehyeoklee commented Feb 20, 2026

🚀 Key Changes

✨ Implementation Details

1. ROCm Driver Installation (installRocmDriver.sh)

2. vLLM Environment Deployment (deployvLLMAmd.sh)

3. vLLM Model Serving (servevLLMAmd.sh)

💻 Test Environment and Results

✅ Azure (Radeon PRO V710)

Uh oh!

seokho-son commented Feb 20, 2026

Uh oh!

seokho-son commented Feb 23, 2026

Uh oh!

leehyeoklee commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leehyeoklee commented Feb 23, 2026

Uh oh!

seokho-son commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

1. ROCm Driver Installation (`installRocmDriver.sh`)

2. vLLM Environment Deployment (`deployvLLMAmd.sh`)

3. vLLM Model Serving (`servevLLMAmd.sh`)

leehyeoklee commented Feb 23, 2026 •

edited

Loading