Update Qwen to Flash Attn and Fix Transformers Bug #1878

Matvezy · 2026-01-06T07:34:52Z

Description

Qwen was slow due to not using flash attn on available hardware. Also with new transformers a new bug of loading in loras for base weights was introduced. This PR fixes both. Additionally this PR standardizes things between Qwen 2.5 and 3.

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

How has this change been tested, please provide a testcase or example of how you tested the change?

Tested Locally

Any specific deployment considerations

No

Docs

Docs updated? What were the changes: No

…_qwen3vl

Matvezy added 6 commits January 5, 2026 23:15

upd to use flash attn 2

ce8e0e9

Merge branch 'main' of https://github.com/roboflow/inference into add…

83ad157

…_qwen3vl

add res limits

f2bf4fb

inf exp changes

0904450

update flash attn and trsnformers lora bug

4bac1ab

Merge branch 'main' of https://github.com/roboflow/inference into add…

e8f0e21

…_qwen3vl

Matvezy requested review from PawelPeczek-Roboflow, grzegorz-roboflow, hansent, probicheaux and yeldarby as code owners January 6, 2026 07:34

style

30d9c19

grzegorz-roboflow approved these changes Jan 6, 2026

View reviewed changes

grzegorz-roboflow merged commit 46001a6 into main Jan 6, 2026
52 checks passed

grzegorz-roboflow deleted the add_qwen3vl branch January 6, 2026 11:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Qwen to Flash Attn and Fix Transformers Bug #1878

Update Qwen to Flash Attn and Fix Transformers Bug #1878

Uh oh!

Matvezy commented Jan 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update Qwen to Flash Attn and Fix Transformers Bug #1878

Update Qwen to Flash Attn and Fix Transformers Bug #1878

Uh oh!

Conversation

Matvezy commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Matvezy commented Jan 6, 2026 •

edited

Loading