Qwen3 Support by cjpais · Pull Request #743 · mozilla-ai/llamafile

cjpais · 2025-04-13T19:13:50Z

Adds support for Qwen3 and Qwen3MoE models. It looks like there will be more changes when the models are released.

cjpais · 2025-04-30T18:38:33Z

@corebonts not sure if you have time but could you review or test this PR? I am struggling to get proper output from the model, but I don't see anything major from the code changes that is incorrect, but I might just not be able to spot it.

corebonts · 2025-04-30T19:39:22Z

I can at least give it a try tomorrow. I hope I don't forget it :)

cjpais · 2025-04-30T20:01:28Z

thank you so much

corebonts · 2025-05-01T11:51:07Z

I had a quick look, and I also got completely broken response. I tried 0.6B and 30B-A3B models from Ollama and 0.6B bartowski from huggingface (just to double check).
So far I haven't seen anything odd in the code but I will have another look later.

cjpais · 2025-05-01T15:03:30Z

Thanks for checking, that matches what I have as well

corebonts · 2025-05-01T17:16:56Z

Sadly I did not have progress :/ I checked again, but I haven't found any problem. I even compared it with qwen2 code, but did not found anything special there, and that is working.

cjpais · 2025-05-01T21:09:00Z

Same, I've looked pretty hard and tried having some AI help and still not spotting anything really. Not sure if there is some change upstream that is needed, but I quite frankly don't know what it is

reneleonhardt · 2025-05-02T04:18:52Z

As long as llamafile has not rebased llama.cpp with minimal changes on top it will be hard to know if the problem is here or upstream.

cjpais · 2025-05-02T18:10:16Z

yes

ikawrakow · 2025-05-12T07:30:15Z

They have changed the way self attention is built in mainline llama.cpp and this is why the PR is not working. I think it will be easier to use the model ports in ik_llama.cpp as there the graph is still built in the old way. This is the PR for Qwen/Qwen3-MoE there.

cjpais · 2025-05-12T14:54:12Z

Thank you @ikawrakow, I will take a look and give it a try

Edit: I did just copy/paste the necessary changes and it looks like they work with llamafile. Going to review a bit deeper as llm_build_moe_ffn is missing some params in llamafile as compared to ik, either want to port those changes or verify that it works the same

qwen3

8348de9

github-actions bot added the llama.cpp label Apr 13, 2025

alonsosilvaallende mentioned this pull request Apr 29, 2025

Bug: Qwen3 is not supported yet #751

Closed

add names of models in

5d67144

use code from ik

e7b29e1

cjpais merged commit 51b357b into mozilla-ai:main May 13, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen3 Support#743

Qwen3 Support#743
cjpais merged 3 commits intomozilla-ai:mainfrom
cjpais:qwen3-support

cjpais commented Apr 13, 2025

Uh oh!

cjpais commented Apr 30, 2025

Uh oh!

corebonts commented Apr 30, 2025

Uh oh!

cjpais commented Apr 30, 2025

Uh oh!

corebonts commented May 1, 2025

Uh oh!

cjpais commented May 1, 2025

Uh oh!

corebonts commented May 1, 2025

Uh oh!

cjpais commented May 1, 2025

Uh oh!

reneleonhardt commented May 2, 2025

Uh oh!

cjpais commented May 2, 2025

Uh oh!

ikawrakow commented May 12, 2025

Uh oh!

cjpais commented May 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

cjpais commented Apr 13, 2025

Uh oh!

cjpais commented Apr 30, 2025

Uh oh!

corebonts commented Apr 30, 2025

Uh oh!

cjpais commented Apr 30, 2025

Uh oh!

corebonts commented May 1, 2025

Uh oh!

cjpais commented May 1, 2025

Uh oh!

corebonts commented May 1, 2025

Uh oh!

cjpais commented May 1, 2025

Uh oh!

reneleonhardt commented May 2, 2025

Uh oh!

cjpais commented May 2, 2025

Uh oh!

ikawrakow commented May 12, 2025

Uh oh!

cjpais commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cjpais commented May 12, 2025 •

edited

Loading