Pinned
Wrote a guide on how to setup this local coding agent on macOS.
"Running Gemma 4 26B-A4B and Qwen3.6 35B-A3B locally with llama.cpp, MTP speculative decoding, multimodal support, and PI as a coding agent."
This actually makes Gemma 4 26B-4A usable for a coding agent @ 72tk/s on my MacBook Pro M1 Max.
This video is realtime, running completely locally.
00:00
ikyle.me
How to Setup a Local Coding Agent on macOS
Running Gemma 4 26B-A4B and Qwen3.6 35B-A3B locally with llama.cpp, MTP speculative decoding, multimodal support, and PI as a coding agent.











