Pinned
worked on having gpt4o drive open-loop motor primitives like waving or grabbing from stereo vision. special tokens like <distance: 10cm> get streamed straight into the transcript stream, which gpt4o reacts too.
next up is integrating the closed-loop hand tracking. main challenge
00:00



