Add RoPE array-offset overload (prep for continuous batching) by ronaldmannak · Pull Request #305 · ml-explore/mlx-swift

ronaldmannak · 2025-12-01T20:40:53Z

Proposed changes

Add array offset support to MLXFast.RoPE to match the Python MLX API.

Dependencies

Tested with MLX commit: ml-explore/mlx@60939d0
Tested with MLX-C commit: ml-explore/mlx-c@3ff8f64

Motivation

The Python mlx.core.fast.rope function accepts offset as either an int or array.
This enables several important use cases:

Continuous batching: Different sequences in a batch at different positions
Speculative decoding: Verifying multiple candidate tokens at different positions in parallel
Sliding window attention: Processing long sequences in chunks starting at different offsets

Changes

Add MLXFast.RoPE(..., offset: MLXArray, ...) overload
Add MLXNN.RoPE.callAsFunction(_:offset:) overload
Add unit tests

Example

// Batch of 3 sequences at positions [50, 20, 0]
let offsets = MLXArray([50, 20, 0])
let result = MLXFast.RoPE(queries, dimensions: 64, traditional: false, base: 10000, scale: 1.0, offset: offsets)

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

davidkoski · 2025-12-01T21:43:19Z

Requires: https://github.com/ml-explore/mlx-c/pull/85 the underlying C binding and core support for array-offset RoPE must be merged first.

Ahh, missed that one. Likely Ronan will do that a different way as he has some automated tools for keeping the API in sync.

ronaldmannak · 2025-12-01T21:47:19Z

@davidkoski Ah, that explains the lack of newlines in MLX-c :) Happy to wait until it's exposed using the automated tools. Is there any ETA?

davidkoski · 2025-12-01T21:50:05Z

Is there any ETA?

Not sure, but I can ping Ronan.

ronaldmannak · 2026-01-07T17:34:49Z

@davidkoski I think this is ready to go.
Be sure to check the MLX and MLX-C dependencies. I believe there are some issues (unrelated to this PR) in newer commits.

davidkoski · 2026-01-07T17:41:27Z

@davidkoski I think this is ready to go. Be sure to check the MLX and MLX-C dependencies. I believe there are some issues (unrelated to this PR) in newer commits.

Yeah, I think this can probably fit in after #319, which should pick up the new mlx/mlx-c dependencies.

ronaldmannak · 2026-01-07T17:51:28Z

Sounds good!

davidkoski · 2026-01-08T21:43:27Z

@ronaldmannak I think this can be rebased -- I just merged the v0.30.1 update. I forgot and already cut a tag but I can cut a new one once this is in.

davidkoski · 2026-01-08T21:46:17Z

Also, take a look at:

public protocol OffsetLayer: Module {
    func callAsFunction(_ x: MLXArray, offset: Int) -> MLXArray
}

Should this have an array offset method? Will it apply to all of the RoPE variants in mlx-swift-lm? (See ml-explore/mlx-swift-lm#29)

ronaldmannak · 2026-01-10T01:17:33Z

Hi David, sorry for the slow reply. I had to spend some time following the thread through the code :)

It looks like OffsetLayer came in via mlx-swift #322 and is now used quite a bit in mlx-swift-lm #29. Since my PR (#305) adds RoPE support for offset: MLXArray, I agree we now have a small API mismatch: the protocol only models offset: Int.

Possible ways forward:

Add an MLXArray-offset overload to OffsetLayer:
func callAsFunction(_ x: MLXArray, offset: MLXArray) -> MLXArray
Or introduce a separate protocol (e.g. ArrayOffsetLayer) for layers that support per-sequence offsets, keeping OffsetLayer as-is.

Either way, to minimize churn in the big mlx-swift-lm #29 change set, I’m inclined to update this PR after #29 has merged (unless you’d prefer to align it sooner). What direction do you prefer, and should this apply across all RoPE variants in mlx-swift-lm?

davidkoski · 2026-01-10T01:31:05Z

Or introduce a separate protocol (e.g. ArrayOffsetLayer) for layers that support per-sequence offsets, keeping OffsetLayer as-is.

I was thinking about that too -- I think maybe a separate protocol is a better idea. That way we don't force all layers to implement it (or end up in a situation where they can't).

Timing-wise, that sounds good. I hope to get all that merged early next week.

ronaldmannak · 2026-01-10T01:38:46Z

Early next week sounds good. I’ll wait for the changes to land.

I’m not sure whether or not every implementation could or couldn't realistically support the offset: MLXArray overload, so introducing a separate protocol (for layers that can handle per-sequence offsets) seems like the safest option indeed.

ronaldmannak · 2026-01-15T01:40:25Z

@davidkoski I've added the protocol. I think it's good to go

davidkoski · 2026-01-15T04:10:18Z

hit a swift-format item

ronaldmannak · 2026-01-15T04:21:10Z

@davidkoski yep sorry, fixed.

davidkoski

Looks great, thank you!

ronaldmannak force-pushed the main branch from e22e582 to 700c7de Compare December 3, 2025 16:52

Expose mlx_fast_rope_offset_array

2fd1c69

ronaldmannak force-pushed the main branch from 700c7de to 2fd1c69 Compare December 3, 2025 16:57

ronaldmannak added 4 commits December 6, 2025 17:37

Make MLXError conform Sendable and Equatable

ed596a3

Merge branch 'ml-explore:main' into main

958faa8

Update RoPE

28ea7ff

Merge branch 'ml-explore:main' into main

886a5e8

ronaldmannak changed the title ~~Expose mlx_fast_rope_offset_array~~ Add RoPE array-offset overload (prep for continuous batching) Jan 7, 2026

ronaldmannak added 2 commits January 14, 2026 16:49

Merge branch 'ml-explore:main' into main

014a660

Add ArrayOffsetProtocol

057366a

fix swift lint

1c87c3c

davidkoski approved these changes Jan 15, 2026

View reviewed changes

davidkoski merged commit 0bb133a into ml-explore:main Jan 15, 2026
7 checks passed

Conversation

ronaldmannak commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Dependencies

Motivation

Changes

Example

Checklist

Uh oh!

davidkoski commented Dec 1, 2025

Uh oh!

ronaldmannak commented Dec 1, 2025

Uh oh!

davidkoski commented Dec 1, 2025

Uh oh!

ronaldmannak commented Jan 7, 2026

Uh oh!

davidkoski commented Jan 7, 2026

Uh oh!

ronaldmannak commented Jan 7, 2026

Uh oh!

davidkoski commented Jan 8, 2026

Uh oh!

davidkoski commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ronaldmannak commented Jan 10, 2026

Uh oh!

davidkoski commented Jan 10, 2026

Uh oh!

ronaldmannak commented Jan 10, 2026

Uh oh!

ronaldmannak commented Jan 15, 2026

Uh oh!

davidkoski commented Jan 15, 2026

Uh oh!

ronaldmannak commented Jan 15, 2026

Uh oh!

davidkoski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ronaldmannak commented Dec 1, 2025 •

edited

Loading

davidkoski commented Jan 8, 2026 •

edited

Loading