FlowMatchingLearning

Creator: Stephen Zhu

See devpost describing inspiration of project and what project does

Quick setup and run

This project uses:

Phase A ingestion on Modal + Actian VectorAI storage
Phase B reasoning with OpenAI
FastAPI backend + React frontend

Follow this once on a new machine.

1) Create and activate Conda environment

conda create -n modal python=3.11 -y
conda activate modal
python --version

If Node.js is not installed on your machine, install it in the same env:

conda install -c conda-forge nodejs -y
node --version
npm --version

2) Install Python + frontend packages

From repo root:

pip install --upgrade pip
pip install -e ".[phase_a,phase_b,api,dev]"

Install frontend dependencies:

cd frontend
npm install
cd ..

Actian client package (required for Phase A storage)

If you are running Phase A ingestion, install the Actian Cortex Python client wheel:

pip install /path/to/actiancortex-0.1.0b1-py3-none-any.whl

If you do not have the wheel file, ask the organizers and see backend/docs/ACTIAN_SETUP.md.

3) Modal setup and deployment

Authenticate Modal once:

modal setup

Deploy the three Phase A services (from repo root):

modal deploy backend/modal/transcription_service.py
modal deploy backend/modal/vision_extraction_service.py
modal deploy backend/modal/embedding_service.py

The backend expects these deployed Modal app/function names:

phase-a-transcription / transcribe_media
phase-a-vision-extraction / extract_document_vision
phase-a-embedding / embed_chunks

4) API keys and `.env`

Create a .env file at repo root (auto-loaded by run scripts):

# Required
OPENAI_API_KEY=your_openai_api_key_here

# Recommended defaults
OPENAI_REASONING_MODEL=gpt-4.1-mini
OPENAI_REASONING_TEMPERATURE=0.0
OPENAI_REASONING_MAX_OUTPUT_TOKENS=2500
TOC_PROMPT_VERSION=2026-02-28.v2
SECTION_CONCEPT_PROMPT_VERSION=2026-03-01.v3
EDGE_VALIDATION_PROMPT_VERSION=2026-03-01.v3

# Actian VectorAI (defaults shown)
ACTIAN_VECTORAI_ADDR=localhost:50051
ACTIAN_COLLECTION_PREFIX=course_chunks
ACTIAN_DISTANCE_METRIC=COSINE
ACTIAN_HNSW_M=16
ACTIAN_HNSW_EF_CONSTRUCT=200
ACTIAN_HNSW_EF_SEARCH=50

API keys/accounts you need:

OpenAI API key (OPENAI_API_KEY) from OpenAI platform
Modal account auth (set by running modal setup)

No extra API key is required for local Actian VectorAI by default.

5) Run everything (two terminal strategy)

Use the same modal conda env in both terminals.

Terminal 1: start backend API

cd /Users/steph/Desktop/FlowMatchingLearning
conda activate modal
python backend/tools/run_api.py --host 127.0.0.1 --port 8000 --reload

Terminal 2: start frontend

cd /Users/steph/Desktop/FlowMatchingLearning
conda activate modal
cd frontend
npm run dev

Then open:

Frontend: http://127.0.0.1:5173
Backend docs: http://127.0.0.1:8000/docs

Frontend requests to /api/* are proxied to the backend on port 8000.

6) Helpful checks

Run preflight checks after setup:

python backend/tools/preflight.py --phase all --skip-actian

If your Actian service is already running and reachable:

python backend/tools/preflight.py --phase all

7) Useful API endpoints

POST /api/v1/upload
POST /api/v1/jobs/start
POST /api/v1/jobs/start-combined
GET /api/v1/jobs/{job_id}
GET /api/v1/jobs/{job_id}/graph
POST /api/v1/jobs/{job_id}/export

Hyperparameters file: backend/config/hyperparameters.json

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
backend		backend
frontend		frontend
sample_data		sample_data
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FlowMatchingLearning

Quick setup and run

1) Create and activate Conda environment

2) Install Python + frontend packages

Actian client package (required for Phase A storage)

3) Modal setup and deployment

4) API keys and `.env`

5) Run everything (two terminal strategy)

Terminal 1: start backend API

Terminal 2: start frontend

6) Helpful checks

7) Useful API endpoints

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FlowMatchingLearning

Quick setup and run

1) Create and activate Conda environment

2) Install Python + frontend packages

Actian client package (required for Phase A storage)

3) Modal setup and deployment

4) API keys and .env

5) Run everything (two terminal strategy)

Terminal 1: start backend API

Terminal 2: start frontend

6) Helpful checks

7) Useful API endpoints

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

4) API keys and `.env`

Packages