Delta net precision by angeloskath · Pull Request #997 · ml-explore/mlx-lm

angeloskath · 2026-03-14T03:32:03Z

Since both batch and vectorized go through the kernel there is almost 0 overhead for switching to an fp32 state.

Qwen/Qwen3.5-9B

Before
Averages: prompt_tps=1567.420, generation_tps=39.407, peak_memory=19.544

After
Averages: prompt_tps=1568.009, generation_tps=39.199, peak_memory=19.571

I haven't noticed any real difference in daily use with or without this update.

This does affect finetuning fairly heavily but I think we need a kernel for that to be an enjoyable experience anyway.

nastya236 · 2026-03-14T16:15:27Z

thank you! looks great!

angeloskath added 2 commits March 13, 2026 20:19

Switch state to fp32

db8d174

Add a test that fails with low precision state

77408de

angeloskath requested review from andresy and nastya236 March 14, 2026 03:32

nastya236 approved these changes Mar 14, 2026

View reviewed changes

angeloskath mentioned this pull request Mar 15, 2026

Nemotron super support #992

Merged

angeloskath merged commit 735a43b into main Mar 15, 2026
2 checks passed

angeloskath deleted the delta-net-precision branch March 15, 2026 22:39

SudarkinV mentioned this pull request Apr 27, 2026

Add Metal VJP kernel for gated_delta_update (trainable Qwen3.5 / Qwen3-Next LoRA on Apple Silicon) #1217

Open

somegeekintn mentioned this pull request May 27, 2026

Fix consistent float32 dtype for gated delta SSM operations in Qwen3.5 models ml-explore/mlx-swift-lm#317

Merged

4 tasks

ttupper92618 mentioned this pull request Jun 4, 2026

Reconcile Foxlight mlx-lm fixes onto upstream v0.31.3 Foxlight-Foundation/mlx-lm#1

Merged

tsato081 mentioned this pull request Jun 10, 2026

Add chunk-parallel gated delta ops for training #1389

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delta net precision#997

Delta net precision#997
angeloskath merged 2 commits into
mainfrom
delta-net-precision

angeloskath commented Mar 14, 2026

Uh oh!

nastya236 commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

angeloskath commented Mar 14, 2026

Uh oh!

nastya236 commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants