Recursive Agent Improvement

Recursive Agent Improvement is a small research/project idea about extending AI agents without modifying the base LLM. Instead of trying to make one language model good at everything, the agent can identify recurring weaknesses, build specialized tools or small machine learning models for those capabilities, and expose them back to the agent through tool calls or skills.

Core idea

LLMs are strong planners, coders, explainers, and orchestrators. They are often weaker at narrow, measurable tasks such as perception, ranking, scoring, anomaly detection, extraction, and verification. Those gaps can be handled by specialist systems.

A recursive improvement loop looks like this:

Notice a recurring LLM weakness.
Define a narrow task with measurable success criteria.
Create or collect a small dataset.
Train a specialist model, evaluator, reranker, classifier, or other tool.
Wrap the specialist as an agent-accessible tool.
Add a skill describing when and how to use it.
Let the agent use the new capability in future work.

Examples

A screenshot evaluator that detects broken layouts, poor contrast, clipped text, or weak visual hierarchy.
A code-change risk classifier that flags patches likely to affect unrelated behavior.
A paper-search reranker tuned for AI safety relevance.
An audio detector for wakewords, events, or speech activity.
A fact-consistency checker for retrieved sources and generated summaries.
A style-matching model for generated desktop themes, UI mockups, or visual artifacts.

Architecture

Base LLM
  -> plans, writes code, explains, orchestrates

Tools
  -> execute deterministic actions, inspect files, search, run commands

Specialist ML tools
  -> perceive, classify, rank, score, detect, verify

Skills
  -> persistent procedural memory for when and how to use tools

The goal is not to make the LLM magically better by prompting. The goal is to build an expanding exocortex of narrow, trainable capabilities around the agent.

Templates

This repo includes a cloneable skeleton for building new PyTorch specialist training projects:

cp -a assets/pytorch-training-pipeline ~/Documents/my-specialist-tool
cd ~/Documents/my-specialist-tool

The template is intentionally implementation-free. It defines the structure, docs, configs, evaluation gates, artifact layout, and agent tool contract before task-specific PyTorch code is written.

Related skill

Use this with the Tier 1-2-3 Skill System:

https://github.com/H-Ali13381/tier-1-2-3-skill-system

Tier 1-2-3 decides whether a workflow should stay text-only, become script-backed, or escalate to an ML/specialist pipeline. Recursive Agent Improvement is the companion for the Tier 3 path: building the specialist tool, evaluator, verifier, reranker, classifier, detector, or training pipeline.

For agents

SKILL.md contains the agent-facing version of this model.

Install directly from GitHub:

npx skills add H-Ali13381/recursive-agent-improvement

The root SKILL.md is canonical. A complete mirror copy also lives under .agents/skills/recursive-agent-improvement/ so cross-client skill scanners can discover the skill with its bundled assets.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.agents/skills/recursive-agent-improvement		.agents/skills/recursive-agent-improvement
assets/pytorch-training-pipeline		assets/pytorch-training-pipeline
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recursive Agent Improvement

Core idea

Examples

Architecture

Templates

Related skill

For agents

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Recursive Agent Improvement

Core idea

Examples

Architecture

Templates

Related skill

For agents

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages