Pinned
Gemma 4 12B in action: Object detection, function calling, voice command, segmentation, language switch, translation - all of this and much more without vision/audio encoders!
(Inputs and outputs are real, but FC2 data shown as code, and generation speedified)
00:00













