Pinned
Apple Silicon + Gemma 4 fans: this is for you.
Pico AI Server now supports continuous batching with MLX-Swift.
43 tok/s on 1 stream.
26 tok/s per stream on 2 concurrent streams.
That’s 52 tok/s total. a 21% throughput gain on a six-year-old MacBook Pro M1 Max!
00:00









