DiffusionGemma Open Model Drops via Google and NVIDIA
Autoregressive token generation just hit a massive bottleneck. Google DeepMind and NVIDIA have teamed up to drop DiffusionGemma, an experimental model that rejects the standard word-by-word approach to generate text in massive parallel blocks. If you are building local agents, your development pipeline just changed completely.