support for LLMBasic (mlx-swift-examples) by davidkoski · Pull Request #29 · ml-explore/mlx-swift-lm

davidkoski · 2025-12-18T22:03:16Z

add a minimal LLM chat example + switch to mlx-swift 0.30.2 mlx-swift-examples#454
fixes [BUG] gemma3text crashes if the attention mask is used #27
move ChatSession integration tests into new test target so we can more easily control when it runs
make a ChatSession unit (more or less) test
fix Sendable / thread safety issues uncovered by LLMBasic

Note that this requires changes in mlx-swift (so likely a new tag there):

Proposed changes

Please include a description of the problem or feature this PR is addressing. If there is a corresponding issue, include the issue #.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

davidkoski · 2025-12-18T22:05:09Z

Libraries/MLXLLM/Models/Gemma3Text.swift

+                dims: headDim, base: config.ropeTheta, traditional: false,
+                scalingConfig: config.ropeScaling,
+                maxPositionEmbeddings: config.maxPositionEmbeddings)
+        }


Picking up changes post initial port: ml-explore/mlx-lm@714157b...main

davidkoski · 2025-12-18T22:05:32Z

Libraries/MLXLLM/Models/Mistral3Text.swift

-            return suScaledRope(x, offset: offset)
-        }
-        return x
-    }


See ml-explore/mlx-swift#322

davidkoski · 2025-12-18T22:06:01Z

Libraries/MLXLMCommon/AttentionUtils.swift

    } else {
        let (cachedKeys, cachedValues) = cache.update(keys: keys, values: values)
+        // TODO dkoski
+        //        print("\(cachedKeys.shape) \(cachedValues.shape) \(queries.shape), \(mask.masks?[0].shape ?? [])")


WIP debug stuff :-)

davidkoski · 2025-12-18T22:07:23Z

Libraries/MLXLMCommon/ModelContainer.swift

+        _ action: @Sendable (isolated ModelContainer) async throws -> sending R
+    ) async rethrows -> sending R {
+        try await action(self)
+    }


@DePasqualeOrg FYI, trying some different things out re your recent cleanups around Sendable and thread safety. I have some tests that repro some threading issues (based on the LLMBasic example I made).

davidkoski · 2025-12-18T22:12:00Z

Tests/MLXLMIntegrationTests/ChatSessionIntegrationTests.swift

+import XCTest
+
+/// Tests for the streamlined API using real models
+public class ChatSessionTests: XCTestCase {


@DePasqualeOrg FYI moved this into an IntegrationTests directory -- I am not sure this should run on CI as these are rather large, but I think the tests are valuable to run locally.

That makes sense. I thought about that when I modified this test, but I didn't realize that it could be excluded from CI.

davidkoski · 2025-12-18T22:14:48Z

Tests/MLXLMTests/ChatSessionTests.swift

-        let result = try await session.respond(to: "What is 2+2? Reply with just the number.")
-        print("One-shot result:", result)
-        XCTAssertTrue(result.contains("4") || result.lowercased().contains("four"))
+    func testChatSessionAsyncInterrupt() async throws {


@DePasqualeOrg FYI an example of some concurrency issues related to the issues you were working on.

This triggers a variety of crashes:

thread safety -- hold lock while calling stream sync mlx-swift#323

[BUG] gemma3text crashes if the attention mask is used #27

and a couple others without issues where the streaming response is still running for a short time after the loop terminates early and we are doing concurrent modification of the KVCache.

I will use this to test actual fixes.

davidkoski · 2025-12-18T22:15:28Z

Tests/MLXLMTests/ChatSessionTests.swift

-            Self.llmContainer, instructions: "You are a helpful assistant. Keep responses brief.")
+    @MainActor
+    func testViewModel() async throws {
+        let model = ChatModel(model: model())


And this one simulates the activity from LLMBasic which also causes thread safety issues.

- ml-explore/mlx-swift-examples#454 - fixes #27 - move ChatSession integration tests into new test target so we can more easily control when it runs - make a ChatSession _unit_ (more or less) test - fix Sendable / thread safety issues uncovered by LLMBasic - collect TestTokenizer and friends in its own file. fix warnings in tests - UserInputProcessors -> structs

davidkoski · 2026-01-08T21:51:40Z

I think this work is complete - I am going to split it into a couple PRs with a little more focus

davidkoski · 2026-01-10T01:21:49Z

Closing in favor of #52, #53, #54, #55

davidkoski commented Dec 18, 2025

View reviewed changes

davidkoski force-pushed the llmbasic-support branch 2 times, most recently from 1d94ca6 to 9063912 Compare January 6, 2026 19:14

davidkoski mentioned this pull request Jan 8, 2026

Add RoPE array-offset overload (prep for continuous batching) ml-explore/mlx-swift#305

Merged

4 tasks

davidkoski force-pushed the llmbasic-support branch from 77b0b54 to 1b62ad0 Compare January 8, 2026 21:51

This was referenced Jan 8, 2026

[BUG] embedding model dimensions: MLX converted vs not #36

Closed

adopt mlx-swift 0.30.2 #52

Merged

davidkoski closed this Jan 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for LLMBasic (mlx-swift-examples)#29

support for LLMBasic (mlx-swift-examples)#29
davidkoski wants to merge 1 commit intomainfrom
llmbasic-support

davidkoski commented Dec 18, 2025 •

edited

Loading

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

DePasqualeOrg Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski commented Jan 8, 2026

Uh oh!

davidkoski commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

davidkoski commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidkoski commented Jan 8, 2026

Uh oh!

davidkoski commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

davidkoski commented Dec 18, 2025 •

edited

Loading