Add OpenVLA model support by yongming-qin · Pull Request #29738 · vllm-project/vllm

yongming-qin · 2025-11-30T01:20:49Z

Purpose

Add support for OpenVLA model in vLLM. OpenVLA is a vision-language-action model that uses timm-based vision backbones (Prismatic architecture) with LLM backbones for action prediction tasks. This implementation follows the same pattern as DeepSeek-VL2, which also uses timm for vision processing.

This PR adds:

Model executor implementation (vllm/model_executor/models/openvla.py)
Configuration class (vllm/transformers_utils/configs/openvla.py)
Processor class (vllm/transformers_utils/processors/openvla.py)

The implementation supports:

Single and fused vision backbones using timm ViT models
Multiple LLM backbones (Llama-2, Mistral, Phi-3)
Image embedding insertion via prompt updates
Tensor parallelism support for vision backbone

FIX #14739

Test Plan

Basic inference test:

vllm serve openvla/openvla-7b --trust-remote-code

Test with image input:
- Use OpenAI-compatible API to send requests with image data
- Verify image embeddings are correctly processed and inserted
Compare outputs with HuggingFace implementation:
- Run inference on same inputs with both vLLM and HF implementations
- Verify output logits/tokens match

Test Result

[To be filled after testing]

yongming-qin · 2025-11-30T01:24:19Z

Note: Currenly the model can be loaded by vllm and we can use it to process image + text instruction. However, the results of vllm and Transformers are different. Comments and collaboration are welcome.

mergify · 2025-12-08T19:21:20Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @yongming-qin.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

…vision. Signed-off-by: Luke <yq0536@gmail.com>

…penvla-7b Signed-off-by: Luke <yq0536@gmail.com>

yongming-qin mentioned this pull request Nov 30, 2025

[New Model]:Can you support the VLA series models? For example, openVLA. #14739

Closed

1 task

mergify Bot added the new-model Requests to new models label Dec 2, 2025

yongming-qin force-pushed the support-openvla-v2 branch from 1b19006 to 2dc2fef Compare December 8, 2025 19:19

mergify Bot added the needs-rebase label Dec 8, 2025

PalmDr mentioned this pull request Jan 15, 2026

[Model] Add OpenVLA model support #32390

Open

4 tasks

yongming-qin added 2 commits January 16, 2026 11:53

Add files for openvla referring deepseek-vl2. They both use timm for …

204a17e

…vision. Signed-off-by: Luke <yq0536@gmail.com>

Add the registry and config files so that vllm can parse HF openvla/o…

f45fa49

…penvla-7b Signed-off-by: Luke <yq0536@gmail.com>

yongming-qin force-pushed the support-openvla-v2 branch from 2dc2fef to f45fa49 Compare January 16, 2026 19:56

mergify Bot removed the needs-rebase label Jan 16, 2026

yiwen101 mentioned this pull request May 13, 2026

[Model] Add OpenVLA model support #42534

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add OpenVLA model support#29738

Add OpenVLA model support#29738
yongming-qin wants to merge 2 commits into
vllm-project:mainfrom
yongming-qin:support-openvla-v2

yongming-qin commented Nov 30, 2025 •

edited by github-actions Bot

Loading

Uh oh!

yongming-qin commented Nov 30, 2025

Uh oh!

mergify Bot commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

yongming-qin commented Nov 30, 2025 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

yongming-qin commented Nov 30, 2025

Uh oh!

mergify Bot commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yongming-qin commented Nov 30, 2025 •

edited by github-actions Bot

Loading