Caltech x Ironsite

Spatial Intelligence in the Physical World

April 24th – April 26th

Vision Data Captured Directly From Job Sites


Your Mission If You Choose to Accept It:

Pinpoint a key spatial task where current models fail. Find a visual task that is trivial for humans, but AI models completely fail at. Use existing models like Gemini 2.5 Pro, Claude Opus, or GPT5 to give them an image or video and ask them a question that reveals their lack of spatial intelligence. We will provide API credits for you to use.

Create an innovative solution to solve the problem. Utilize prompt engineering, simple fine-tuning, inference time compute strategies, or any other clever techniques you can think of to improve how well the models can perform on that task. Be detailed and scientifically minded.

Demo your technique on a real-world problem. Your idea doesn’t have to solve the problem perfectly, but rather show how these models can be augmented to increase their spatial intelligence, even for a narrow use case or task. It’s even better if the task you solve could have real world impact.

The Details


Image Alignment 300x200

Prize Pool:

5,000 first place, 2,000 second, 1,500 third

Image Alignment 300x200

Timeline:

April 24th – April 26th

36 Hours of Innovation

Image Alignment 300x200

Teams:

Teams of 1-4.

Image Alignment 300x200

Why Apply:

This is your chance to work on a frontier problem, get noticed, and win big.

Meet the Ironsite Team


Keenan Brekke – FDE

Former Superintendent of Pacific Structures and has led 100+ crews across SF-Bay Area projects.

Daniele More – CSO

Former research tech lead at Google DeepMind and head of model research at the AI-HW startup Etched.

Charu Thomas – CTO

Founder and CEO of OX ($20m raised), an AI-powered wearable for frontline workers.

Max Mona – Founder

A second-generation construction builder and South Park Commons Founder Fellow.