GitHub - v1shay/ml-labs: Autonomous end-to-end machine learning lab powered by coordinated agent swarms; LUMA Hackathon Finalist

An autonomous multi-agent system executing the full machine learning research lifecycle.

Results

Finalist at both the LUMA Hackathon (top 3% of 397 teams) and the Open Source for AI Hackathon
Agents: 19 specialized agents
Research Phases: 8 end-to-end stages
Pipeline Coverage: Full lifecycle (data → deployment)
Execution Mode: Autonomous, multi-agent orchestration
Output: Validated models + structured research artifacts + APIs

Overview

ML-Labs is a fully autonomous machine learning research system that replaces fragmented workflows with a unified, agent-driven architecture.

It was built for the Luma AI Agents Hackathon with a demo focused on training an interactive 3D graph neural network with competition datasets.

Research Phases

Phase	Description
1. Initiation & Planning	Defines objectives, constraints, and system scope
2. Data Foundation	Ingests, structures, and validates datasets
3. Model Development	Selects architectures and builds candidate models
4. Validation & Testing	Runs automated tests and robustness checks
5. Evaluation	Benchmarks performance and analyzes results
6. Research Output	Produces structured reports and documentation
7. Execution Layer	Builds systems, APIs, and deployment pipelines
8. Continuous Loop	Iterates through feedback and refinement cycles

Agent System

Agent	Role
Control Agent	Orchestrates workflow and resource allocation
Problem Framing Agent	Defines objectives, scope, and success metrics
Data Analysis Agent	Explores datasets and identifies patterns
Schema Agent	Designs data structures and transformations
Quality Agent	Ensures data consistency and reliability
Readiness Agent	Validates data for modeling
Model Selection Agent	Chooses algorithms and architectures
Model Agents (Ensemble)	Builds and iterates candidate models
Optimization Agent	Performs tuning and search
Testing Agent (Integrated)	Runs automated validation checks
Field Testing Agent	Evaluates models in real-world scenarios
Evaluation Agent	Measures performance vs objectives
Statistical Agent	Performs statistical analysis and inference
Full Stack Agent	Designs APIs and infrastructure
SWE Agent	Implements backend systems
Senior SWE Agent	Reviews code for scalability and reliability
System Testing Agent	Executes end-to-end validation
Computing Agent	Manages environments and execution
Research Agent	Generates structured outputs and reports

Data

Sources: CSV, Kaggle datasets
Type: structured and semi-structured tabular data
Flow: ingestion → validation → feature construction → modeling

Preprocessing:

schema construction
normalization
anomaly detection
feature engineering

Experiments / Reproduction

npm run dev

Run full pipeline:

curl -X POST http://localhost:3000/api/lab/run

Run inference:

curl -X POST http://localhost:3000/api/lab/predict

Input: raw dataset Output: trained models + evaluation metrics + predictions

Dependencies

Node.js
Python 3.x
scikit-learn
NumPy
pandas

API

GET  /api/lab/demo
POST /api/lab/run
POST /api/lab/predict

/demo → deterministic experiment snapshots
/run → full multi-agent execution
/predict → inference on trained models

Repository Structure

ml-labs/
├── frontend/
├── backend/
├── agents/
├── pipelines/
├── data/
├── experiments/
└── README.md

Installation

npm install

python3 -m venv .venv
. .venv/bin/activate

pip install -r requirements.txt

Optional

npm run build
npm start

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
ML-Labs		ML-Labs
app		app
frontend/ml-labs		frontend/ml-labs
lib/ml-labs		lib/ml-labs
public		public
scripts		scripts
.gitignore		.gitignore
README.md		README.md
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Results

Overview

Research Phases

Agent System

Data

Experiments / Reproduction

Run full pipeline:

Run inference:

API

Repository Structure

Installation

Optional

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Results

Overview

Research Phases

Agent System

Data

Experiments / Reproduction

Run full pipeline:

Run inference:

API

Repository Structure

Installation

Optional

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages