Watch & Learn

Teach AI agents new skills by showing, not writing.

Started at the Gemini x TED AI Hackathon (Oct 2025)

Watch & Learn transforms screen recordings into "skills" for AI agents. Instead of writing tedious documentation, simply record yourself performing a task with narration. The system automatically extracts step-by-step instructions, automation scripts, templates, and reference assets—converting tacit knowledge into structured formats AI agents can follow. Deploy skills as downloadable ZIP files, MCP servers, or run computer-use automations.

The core insight: demonstration captures nuance that written instructions miss, making on-the-job agentic training faster and more reliable.

The Problem

Writing detailed prompts and instructions for AI agents is time-consuming and tedious. Complex workflows require lengthy documentation that's hard to maintain and often misses crucial details that are obvious when you actually perform the task.

Our Solution

Watch & Learn converts screen recordings into executable skill packages. Simply record yourself performing a task with light narration, and our system automatically generates:

SKILL.md - Step-by-step instructions AI agents can follow
Scripts - Automation code extracted from your demonstration
Templates - Configuration files and boilerplate
Assets - Reference screenshots and outputs

These skill packages can be downloaded as zip files, converted into MCP servers for AI integration, or executed directly in the browser via Browserbase.

How It Works

Upload a screen recording (or YouTube URL) showing the task
Our system extracts key frames and transcribes narration
Gemini 2.5 Computer Use model analyzes the video and generates a structured skill package
Review the extracted skill with a real-time thinking trace
Download, integrate, or test-run your new AI skill

Smart Caching: Identical videos are deduplicated by hash, providing instant results for popular tutorials and reducing processing costs.

Tech Stack

Frontend: Next.js • TypeScript • Tailwind CSS • Clerk • Supabase • AWS S3

Backend: Python • FastAPI • Gemini 2.5 Computer Use • Browserbase • Playwright

Project Structure

This is a monorepo containing both the frontend and backend:

show-ai/
├── api/              # Python backend (FastAPI + Browser Automation)
├── src/              # Next.js frontend
├── public/           # Static assets
└── package.json      # Frontend dependencies

Development Setup

Frontend

# Install dependencies
npm install

# Run development server
npm run dev

Open http://localhost:3000 to view the app.

Backend API

See api/README.md for detailed setup instructions.

Quick start:

# Navigate to API directory
cd api

# Set up Python environment
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

# Install Playwright
playwright install chrome

# Set environment variables
export GEMINI_API_KEY="your-key"
export BROWSERBASE_API_KEY="your-key"
export BROWSERBASE_PROJECT_ID="your-project-id"

# Run FastAPI server
uvicorn api_server:app --reload --port 8000

The API will be available at http://localhost:8000

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.cursor/rules		.cursor/rules
api		api
distributed_systems/video_processor		distributed_systems/video_processor
example-skills		example-skills
logos		logos
prompts		prompts
public		public
src		src
.gitignore		.gitignore
CLERK_SETUP.md		CLERK_SETUP.md
CLERK_SUPABASE_INTEGRATION.md		CLERK_SUPABASE_INTEGRATION.md
LICENSE		LICENSE
README.md		README.md
S3_SETUP.md		S3_SETUP.md
STRIPE_SETUP.md		STRIPE_SETUP.md
UNIFIED_VIDEO_UPLOAD_MIGRATION.md		UNIFIED_VIDEO_UPLOAD_MIGRATION.md
components.json		components.json
design-system.md		design-system.md
env.example		env.example
eslint.config.mjs		eslint.config.mjs
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
project-plan.md		project-plan.md
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.tsbuildinfo		tsconfig.tsbuildinfo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Watch & Learn

The Problem

Our Solution

How It Works

Tech Stack

Project Structure

Development Setup

Frontend

Backend API

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Watch & Learn

The Problem

Our Solution

How It Works

Tech Stack

Project Structure

Development Setup

Frontend

Backend API

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages