clarifai

deconstruct complex research papers into digestible concepts and automatically generate video explanations in the style of 3blue1brown.

demos

example clips generated for specific concepts.

the weight monodromy conjecture	word embeddings

rnns vs cnns	bellman's equations

features

pdf upload & analysis: upload research papers in pdf format for comprehensive ai-powered analysis.
key concept extraction: automatically identifies and extracts core concepts, methodologies, and insights from the text using google's gemini flash.
agentic video generation: a langchain agent uses manim to generate high-quality, 3blue1brown-style animations for each concept.
self-correcting code generation: the agent makes up to three attempts to generate and render manim code, analyzing the previous error to correct itself.
intelligent scene splitting: an initial ai call intelligently splits a complex concept into multiple thematic scenes to create a more structured and understandable video narrative.
parallel clip processing: video clips are rendered in parallel (batches of 3) for 3-4x faster generation.
multi-clip video stitching: successfully rendered video clips are automatically stitched together into a final, complete video using ffmpeg.
vercel blob storage: videos are automatically uploaded to vercel blob storage for persistent cdn-backed delivery.
resilient workflow: the video generation process is fault-tolerant; if a single scene fails to render after multiple attempts, it is skipped, and the final video is created from the successful scenes.
real-time progress tracking: websocket connection provides live progress updates with stage indicators and fake progress bar during video generation.
api security: rate limiting (slowapi) and api key authentication to prevent abuse.
ai-powered code implementation: generate functional python code examples for any extracted concept.
responsive ui: a clean and responsive frontend built with next.js and tailwind css with webgl shader background.

tech stack

frontend: next.js 15, react 19, typescript, tailwind css, framer motion, webgl shaders
backend: fastapi, python 3.12, uvicorn, asyncio
ai/ml: google gemini flash 2.0, langchain
video generation: manim community v0.19.0
video processing: ffmpeg
storage: vercel blob (production), local filesystem (development)
security: slowapi (rate limiting), api key authentication, cors
deployment: vercel (frontend), railway (backend), docker

prerequisites

before you begin, ensure you have the following dependencies installed on your system.

1. general

git: for cloning the repository.

2. backend dependencies

python 3.12: the application requires python 3.12 for both backend and agent (unified environment).
ffmpeg:
- macos: brew install ffmpeg
- linux: sudo apt-get update && sudo apt-get install ffmpeg or sudo pacman -s ffmpeg
- windows: choco install ffmpeg or scoop install ffmpeg
latex: required for manim text rendering
- macos: brew install --cask mactex-no-gui
- linux: sudo apt-get install texlive texlive-latex-extra texlive-fonts-recommended

3. frontend dependencies

node.js: version 18.x or later.
npm: usually installed with node.js.

local development setup

clone the repository

git clone https://github.com/yourusername/clarifai
cd clarifai

backend setup

cd backend

# create virtual environment
python3 -m venv venv
source venv/bin/activate  # on windows: venv\Scripts\activate

# install dependencies
pip install -r requirements.txt
pip install -r agent_requirements.txt

# configure environment variables
cp .env.example .env
# edit .env and add your keys:
# GEMINI_API_KEY=your_gemini_api_key
# API_KEY=your_secret_api_key (optional for dev)
# ALLOWED_ORIGINS=http://localhost:3000,http://localhost:8000

# create storage directories
mkdir -p storage/uploads storage/videos clips videos

# start backend server
uvicorn app.main:app --reload --port 8000

frontend setup (in a new terminal)

cd frontend

# install dependencies
npm install

# configure environment variables
cp .env.example .env.local
# edit .env.local:
# NEXT_PUBLIC_API_KEY=your_secret_api_key (must match backend)
# NEXT_PUBLIC_API_URL=http://localhost:8000
# NEXT_PUBLIC_WS_URL=ws://localhost:8000

# start development server
npm run dev

access the application
- frontend: http://localhost:3000
- backend api: http://localhost:8000/docs

production deployment

vercel (frontend)

push your code to github
import project to vercel

set environment variables in vercel dashboard:

NEXT_PUBLIC_API_KEY=your_production_api_key
NEXT_PUBLIC_API_URL=https://your-backend.railway.app
NEXT_PUBLIC_WS_URL=wss://your-backend.railway.app

deploy

railway (backend)

create new project from github repo

set environment variables:

API_KEY=your_production_api_key
GEMINI_API_KEY=your_gemini_api_key
ALLOWED_ORIGINS=https://your-app.vercel.app
BLOB_READ_WRITE_TOKEN=vercel_blob_rw_xxxxxx

railway will auto-detect the dockerfile and deploy
videos will automatically upload to vercel blob for persistent storage

vercel blob setup

in vercel dashboard, go to storage → blob
create a new blob store
copy the BLOB_READ_WRITE_TOKEN
add it to railway environment variables

usage

open your web browser and navigate to the deployed url or http://localhost:3000.
upload a research paper using the drag-and-drop uploader.
wait for the ai analysis to complete. key concepts will appear on the page.
on any concept card, click "generate video" to trigger the agentic video generation process.
monitor real-time progress in the video panel with live logs and progress indicators.
once complete, watch or download the generated video.

api rate limits

uploads: 5 per hour per ip
video generation: 10 per hour per ip
general api: 100 requests per hour per ip

project architecture

the application is composed of three main parts:

frontend: a next.js application that provides the user interface for uploading papers, viewing concepts, and watching the generated videos. features webgl shader background animations and real-time websocket updates.
backend: a fastapi server that handles file uploads, orchestrates the analysis and video generation process, and serves the final videos. includes rate limiting, api key authentication, and cors protection.
agent: integrated into the backend via async execution. uses langchain and gemini to generate manim scripts, renders them in parallel (batches of 3), and uploads to vercel blob for persistent storage.

the backend and agent communicate via async subprocess pipelines, with logs and results streamed back to the frontend over websocket connections.

docker deployment

the backend includes a dockerfile for containerized deployment:

# build
docker build -t clarifai-backend ./backend

# run
docker run -p 8000:8000 \
  -e GEMINI_API_KEY=your_key \
  -e API_KEY=your_api_key \
  -e ALLOWED_ORIGINS=https://your-frontend.vercel.app \
  -e BLOB_READ_WRITE_TOKEN=your_blob_token \
  clarifai-backend

troubleshooting

videos return 404 in production

ensure BLOB_READ_WRITE_TOKEN is set in railway
check railway logs for upload errors
verify vercel blob store is created and accessible

cors errors

ensure ALLOWED_ORIGINS in railway includes your vercel url
check that frontend NEXT_PUBLIC_API_URL matches railway backend url

video generation fails

check that GEMINI_API_KEY is valid
ensure ffmpeg and latex are installed (handled by dockerfile in production)
verify sufficient memory allocation in railway (recommend 2GB+)

contributing

contributions are welcome! please feel free to submit a pull request.

license

mit license - see license file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
backend		backend
frontend		frontend
media		media
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
start.sh		start.sh
stop.sh		stop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

clarifai

demos

example clips generated for specific concepts.

features

tech stack

prerequisites

1. general

2. backend dependencies

3. frontend dependencies

local development setup

production deployment

vercel (frontend)

railway (backend)

vercel blob setup

usage

api rate limits

project architecture

docker deployment

troubleshooting

videos return 404 in production

cors errors

video generation fails

contributing

license

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

clarifai

demos

example clips generated for specific concepts.

features

tech stack

prerequisites

1. general

2. backend dependencies

3. frontend dependencies

local development setup

production deployment

vercel (frontend)

railway (backend)

vercel blob setup

usage

api rate limits

project architecture

docker deployment

troubleshooting

videos return 404 in production

cors errors

video generation fails

contributing

license

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages