Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5 by hmellor · Pull Request #28542 · vllm-project/vllm

hmellor · 2025-11-12T10:09:30Z

In Transformers v5:

rope_scaling is now called rope_parameters
rope_theta now lives inside rope_parameters
rope_parameters may be nested for models which have different RoPE parameters for each layer type (i.e. Gemma & ModernBERT)

This PR adds forward compatibility for Transformesr v5 RoPE config by:

Moving any found config.rope_scaling to config.rope_parameters
Moving any found config.rope_theta to config.rope_parameters.rope_theta
Performs parch_rope_parameters on all nested configs if present
Performs patch_rope_parameters_dict on all nested RoPE parameters if present
Globally renaming rope_scaling to rope_parameters
get_rope:
- Remove base as an argument because it no longer needs to be passed separately
- If rope_parameters is None, default to rope base of 10000 which seems to be a universal default
Any models which do not use this 10000 default have it set using the set_default_rope_theta helper

Note, the errors triggered by disable_sliding_window when used with rope scaling models have been removed. It's been left as a follow up task remove disable_sliding_window completely as it is no longer relevant.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify · 2025-11-12T10:10:09Z

Documentation preview: https://vllm--28542.org.readthedocs.build/en/28542/

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

…one` Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: Che Ruan <cr623@ic.ac.uk>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: tanqingshan (A) <50050625@china.huawei.com>

1. fix vllm-project/vllm#28542 The model structure modifications we involved in are: - Qwen2.5-VL(still exist some patch) - Qwen2-VL - Qwen2 - DeepSeek series - Qwen-moe series 2. fix vllm-project/vllm#29121 the output token now type changed from np to `list[list[int]]` 3. fix vllm-project/vllm#29262 `xformers` backend for multimodal now has been deprecated 4. fix vllm-project/vllm#29342 5. fix vllm-project/vllm#28579 6. fix vllm-project/vllm#28718 7. fix vllm-project/vllm#28665 8. fix vllm-project/vllm#26847 vllm introduced the `optimization-level`, some default config has been changed, and the param `--enforce-eager` has been deprecated 9. fix http://github.com/vllm-project/vllm/pull/29223 it retuns tuple for sampler. 10. fix vllm-project/vllm#29471 we'll remove the related patch to avoid this kind of error. Co-authored-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> - vLLM version: v0.11.2 --------- Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: hfadzxy <starmoon_zhang@163.com> Co-authored-by: wangli <wangli858794774@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com>

hmellor added 6 commits November 12, 2025 09:19

Rename rope_scaling -> rope_parameters in get_rope

a62c2df

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Patch rope parameters to new name, rope_parameters

f42b03d

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update models where it's a simple rename

a2a9437

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix model config overrides

fba5bf5

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update examples

ee5cf66

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update benchmarks

080530d

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify bot added documentation Improvements or additions to documentation llama Related to Llama models performance Performance-related issues qwen Related to Qwen models gpt-oss Related to GPT-OSS models speculative-decoding labels Nov 12, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Nov 12, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Nov 12, 2025

hmellor added 11 commits November 12, 2025 11:12

More renaming in transformers utils

889b900

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix patch_rope_parameters for when rope_scaling was explicitly `N…

50b1a87

…one` Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update Gemma3 and Gemma3n

bd182e0

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Merge branch 'main' into update-rope-config

4c61e2e

Get rope_theta from the new location too

65c8658

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix condition for non gemma3 models

5d65739

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Make Transformers backend torch compile check work with new rope params

b4e1967

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Re-enable a load of Transformers nightly tests which are now fixed

ee77bd7

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update the custom configs

df4c007

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Make sure scaling factor always exists

325ff8d

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

A couple more models that now init on v5

11c23a7

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

mergify bot added the ci/build label Nov 13, 2025

hmellor added 3 commits November 13, 2025 12:42

Update Commandr

4ea113c

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update Qwen3Next

59b0f27

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update Olmo2

064441b

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor added 5 commits November 18, 2025 13:46

Merge branch 'main' into update-rope-config

540a46b

Fix get_rope kwargs in vision transformers

a60b5ec

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Update new model

00f2853

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Missed positional args

717a704

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Fix nemotron config validation

a9fa3b0

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

vllm-bot merged commit a8b7030 into vllm-project:main Nov 19, 2025
55 of 57 checks passed

github-project-automation bot moved this from To Triage to Done in gpt-oss Issues & Enhancements Nov 19, 2025

hmellor deleted the update-rope-config branch November 19, 2025 18:32

DarkLight1337 mentioned this pull request Nov 20, 2025

[Bugfix] Fix Plamo3 rope handling #29092

Merged

5 tasks

hl475 mentioned this pull request Nov 20, 2025

[CI Failure] Fix Gemma3 RoPE configuration for sliding attention layers #29111

Merged

5 tasks

juliendenize mentioned this pull request Nov 21, 2025

Fix mistral config #29172

Merged

5 tasks

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

4bca638

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Potabk mentioned this pull request Nov 29, 2025

[Main] Upgrade vllm commit to 2025_12_01 vllm-project/vllm-ascend#4527

Closed

wangxiyuan mentioned this pull request Dec 1, 2025

upgrade vLLM to main vllm-project/vllm-ascend#4608

Merged

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request Dec 1, 2025

Update rope_scaling to rope_parameters in preparation for Transfo…

a458031

…rmers v5 (vllm-project#28542) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

hmellor mentioned this pull request Feb 13, 2026

Update to transformers v5 #30566

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5#28542

Update `rope_scaling` to `rope_parameters` in preparation for Transformers v5#28542
vllm-bot merged 73 commits intovllm-project:mainfrom
hmellor:update-rope-config

hmellor commented Nov 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

hmellor commented Nov 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hmellor commented Nov 12, 2025 •

edited by github-actions bot

Loading