[Model] Add HyperCLOVAX-SEED-Think-14B language model support by bigshanedogg · Pull Request #37107 · vllm-project/vllm

bigshanedogg · 2026-03-15T14:40:07Z

Purpose

Add inference support for HyperCLOVA X (HyperCLOVAXForCausalLM), a large language model family developed by NAVER Cloud.

https://huggingface.co/naver-hyperclovax/HyperCLOVAX-SEED-Think-14B

Changes

vllm/model_executor/models/hyperclovax.py (new) — HyperCLOVAXForCausalLM model implementation
vllm/transformers_utils/configs/hyperclovax.py (new) — HyperCLOVAXConfig configuration class
vllm/model_executor/models/registry.py — Register HyperCLOVAXForCausalLM
vllm/transformers_utils/configs/__init__.py — Register HyperCLOVAXConfig
docs/models/supported_models.md — Add HyperCLOVAXForCausalLM entry
tests/models/registry.py — Add test registry entry (naver-hyperclovax/HyperCLOVAX-SEED-Think-14B)
tests/models/language/generation/test_common.py — Add HyperCLOVAXForCausalLM to common generation tests

Test Plan

Launch server

  vllm serve naver-hyperclovax/HyperCLOVAX-SEED-Think-14B \
    --max-model-len 32768 \
    --max-num-batched-tokens 16384 \
    --tensor-parallel-size 1 \
    --trust-remote-code \
    --enable-prefix-caching

Test Result

Benchmark validation

Tasks	Metric	vLLM (this PR)
hellaswag	acc_norm	0.6521
gsm8k	flexible-extract	0.9484

Evaluated with lm-evaluation-harness defaults and default sampling params for server validation.

Request

client

import requests

payload = {
    "messages": [
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Please briefly explain what you can help with. Think carefully before answering."},
            ],
        }
    ],
    "temperature": 0.2,
    "skip_special_tokens": False,
    "stop": ["<|im_end|><|endofturn|>", "<|im_end|><|stop|>"],
    "chat_template_kwargs": {"skip_reasoning": True},
}

resp = requests.post(
    f"http://{url}/v1/chat/completions", 
    json=payload, 
    timeout=300,
)
resp.raise_for_status()

data = resp.json()
print(data["choices"][0]["message"].get("content"))

output

Okay, the user is asking me to briefly explain what I can help with. Let me start by recalling my capabilities. I know I can answer questions, provide explanations, assist with learning, help brainstorm ideas, and offer suggestions. But I should make sure not to overstate what I can do.

Wait, I should also mention that I can't access real-time information or perform physical actions. That's important to set the right expectations. Maybe start by listing the main areas: answering questions, explaining concepts, helping with tasks like writing or coding, and offering recommendations. But keep it concise since they asked for a brief explanation.

Hmm, should I include examples? The user might appreciate a quick list of specific areas. Like, "I can help with homework, language translation, coding problems, creative writing, and more." Also, clarify that I rely on existing knowledge up to my last update in July 2024. Oh right, and I can't browse the internet or access personal data unless shared in the conversation. Privacy is a key point here.

Wait, the user said "think carefully before answering," so maybe I should structure it clearly. Start with a general statement about assisting with information and tasks, then list key areas, mention limitations, and ensure it's all in a few short sentences. Let me check if I missed anything. Oh, yes, I should avoid jargon and keep it simple. Alright, time to put it all together concisely.<|im_end|>
<|im_start|>assistant
I can assist with providing information, explanations, and guidance across a wide range of topics, including:  
- **Answering questions** (science, history, technology, etc.).  
- **Explaining concepts** (math, programming, philosophy, etc.).  
- **Helping with tasks** (writing, editing, coding, problem-solving).  
- **Offering recommendations** (books, learning resources, strategies).  
- **Brainstorming ideas** (creative projects, studies, discussions).  

**Limitations**: I cannot access real-time data, perform physical actions, or retrieve personal information unless shared during our conversation. My knowledge is current up to July 2024. Let me know how I can assist! 😊

mergify · 2026-03-15T14:40:55Z

Documentation preview: https://vllm--37107.org.readthedocs.build/en/37107/

gemini-code-assist

Code Review

This pull request adds support for the HyperCLOVAX-SEED-Think-14B language model. The changes include a new model implementation, a corresponding configuration class, and updates to the model registries, documentation, and tests. The new implementation handles the model's specific architectural features, such as muP scaling and optional Peri-Layer Normalization. The code is well-structured and follows existing patterns in the vLLM codebase. One issue was found in the test registry update, where a redundant and incorrect entry was added.

DarkLight1337

Otherwise LGTM

Signed-off-by: bigshanedogg <bigshane319@gmail.com>

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

jp1924 · 2026-06-02T05:42:32Z

@bigshanedogg
Adding SEED-Think-14B and 32B is a good idea, but to use them properly, it need to implement reasoning and a tool parser, right?
However, those components are missing from the current vLLM.
The official documentation says to install a plugin, but since it’s installed as a separate dependency package, it’s quite inconvenient to use.
With the vLLM version upgrade, the import structure has changed, causing the plugin to throw a lot of errors.
So, I think work is underway to add the plugin’s reasoning and tool parser. Could you please go to this PR and leave a review?
#42366
#44171

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com> Signed-off-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com>

bigshanedogg requested review from DarkLight1337 and ywang96 as code owners March 15, 2026 14:40

mergify Bot added documentation Improvements or additions to documentation new-model Requests to new models labels Mar 15, 2026

gemini-code-assist Bot reviewed Mar 15, 2026

View reviewed changes

Comment thread tests/models/registry.py Outdated

DarkLight1337 reviewed Mar 16, 2026

View reviewed changes

Comment thread tests/models/language/generation/test_common.py Outdated

DarkLight1337 approved these changes Mar 16, 2026

View reviewed changes

bigshanedogg added 3 commits March 16, 2026 04:51

feat: hyperclovax_seed_think_14b

d6ac93d

Signed-off-by: bigshanedogg <bigshane319@gmail.com>

fix: remove irrelevant model in test registry

1f8ff6e

Signed-off-by: bigshanedogg <bigshane319@gmail.com>

fix: min_gb in test_common

71bc1b9

Signed-off-by: bigshanedogg <bigshane319@gmail.com>

bigshanedogg force-pushed the feat/hyperclovax branch from fe13838 to 71bc1b9 Compare March 16, 2026 04:52

DarkLight1337 enabled auto-merge (squash) March 16, 2026 05:00

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 16, 2026

DarkLight1337 merged commit 2390d44 into vllm-project:main Mar 16, 2026
55 checks passed

Lucaskabela pushed a commit to Lucaskabela/vllm that referenced this pull request Mar 17, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

264dd11

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

347fe5c

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

This was referenced Mar 23, 2026

Add HyperCLOVAX SEED Think 14B huggingface/transformers#44956

Merged

Add HyperCLOVA X SEED Think 14B huggingface/transformers#44957

Open

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

9537bb7

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

5418017

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

mtparet pushed a commit to blackfuel-ai/vllm that referenced this pull request Apr 9, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

2a3c577

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

tarekziade mentioned this pull request May 7, 2026

Add HyperCLOVAX SEED Think 14B tarekziade/tarekziade-transformers-reviewer-test#15

Open

6 tasks

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request May 10, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

2698583

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

e96c515

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

8ab97d8

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request May 19, 2026

[Model] Add HyperCLOVAX-SEED-Think-14B language model support (vllm-p…

1364a10

…roject#37107) Signed-off-by: bigshanedogg <bigshane319@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model] Add HyperCLOVAX-SEED-Think-14B language model support#37107

[Model] Add HyperCLOVAX-SEED-Think-14B language model support#37107
DarkLight1337 merged 3 commits into
vllm-project:mainfrom
bigshanedogg:feat/hyperclovax

bigshanedogg commented Mar 15, 2026 •

edited by github-actions Bot

Loading

Uh oh!

mergify Bot commented Mar 15, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 left a comment

Uh oh!

Uh oh!

jp1924 commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

bigshanedogg commented Mar 15, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes

Test Plan

Launch server

Test Result

Benchmark validation

Request

client

output

Uh oh!

mergify Bot commented Mar 15, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jp1924 commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bigshanedogg commented Mar 15, 2026 •

edited by github-actions Bot

Loading