Skip to content

[Model] Qwen3.5 support w/o vision, WIP#19456

Closed
pwilkin wants to merge 9 commits intoggml-org:masterfrom
pwilkin:qwen35
Closed

[Model] Qwen3.5 support w/o vision, WIP#19456
pwilkin wants to merge 9 commits intoggml-org:masterfrom
pwilkin:qwen35

Conversation

@pwilkin
Copy link
Collaborator

@pwilkin pwilkin commented Feb 9, 2026

Followup to #19435

Rebased on top of #18792 together with @ngxson 's fix for chunking.

Comment on lines +118 to +126
q = ggml_cont_4d(ctx0, ggml_permute(ctx0, q, 0, 2, 1, 3), S_k, n_tokens, H_k, n_seqs);
k = ggml_cont_4d(ctx0, ggml_permute(ctx0, k, 0, 2, 1, 3), S_k, n_tokens, H_k, n_seqs);
v = ggml_cont_4d(ctx0, ggml_permute(ctx0, v, 0, 2, 1, 3), S_v, n_tokens, H_v, n_seqs);
if (is_kda) {
g = ggml_cont_4d(ctx0, ggml_permute(ctx0, g, 0, 2, 1, 3), S_k, n_tokens, H_k, n_seqs);
} else {
g = ggml_cont_4d(ctx0, ggml_permute(ctx0, g, 2, 0, 3, 1), n_tokens, 1, H_k, n_seqs);
}
beta = ggml_cont(ctx0, ggml_permute(ctx0, beta, 2, 0, 1, 3));
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These conts are about to be obsoleted by #19429?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep.

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
@pwilkin
Copy link
Collaborator Author

pwilkin commented Feb 9, 2026

@ggerganov tested on MXFP4 quant for Coder-Next:

No syntax errors detected.

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
@ggerganov
Copy link
Member

I think the unification of the delta net implementations should be done separately from Qwen3.5 - likely in #18792.

I think this will likely take longer than Qwen3.5 release, so it's better to add Qwen3.5 support without basing the implementation on that branch.

@ggerganov ggerganov marked this pull request as draft February 9, 2026 14:37
@pwilkin
Copy link
Collaborator Author

pwilkin commented Feb 9, 2026

@ggerganov okay, I'll do it on master.

@github-actions github-actions bot added model Model specific python python script changes labels Feb 9, 2026
@pwilkin
Copy link
Collaborator Author

pwilkin commented Feb 9, 2026

Aight, since we have an official PR ( #19468 ) I'll gladly pass the baton over to @JJJYmmm :) Closing this one.

@pwilkin pwilkin closed this Feb 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

model Model specific python python script changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants