AgentHelm

Production-Ready Orchestration for AI Agents

AgentHelm is a lightweight Python framework for building AI agents with a focus on production-readiness. Built on DSPy, it provides plan-driven execution, full observability, and transactional safety guarantees.

✨ Key Features

🧠 DSPy-Native Agents

import dspy
from agenthelm import ToolAgent, tool

@tool()
def search(query: str) -> str:
    """Search the web."""
    return f"Results for: {query}"


lm = dspy.LM("mistral/mistral-large-latest")
agent = ToolAgent(name="researcher", lm=lm, tools=[search])
result = agent.run("Find AI news")

📋 Plan-Driven Execution

# Generate a multi-step plan
agenthelm plan "Build a web scraper" -o plan.yaml

# Review and execute
agenthelm execute plan.yaml

🔍 Full Observability

# View execution traces
agenthelm traces list

# Filter and export
agenthelm traces filter --status failed
agenthelm traces export -o report.md -f md

🔗 MCP Protocol Support

# Connect to MCP servers
agenthelm mcp list-tools uvx mcp-server-time
agenthelm mcp run uvx mcp-server-time -t "What time is it?"

🛡️ Transactional Safety

@tool(compensating_tool="reverse_payment")
def process_payment(amount: float) -> dict:
    """Process payment - rolls back on failure."""
    return payment_api.charge(amount)


@tool(requires_approval=True)
def delete_data(id: str) -> bool:
    """Dangerous operation - requires approval."""
    return db.delete(id)

🚀 Quick Start

Installation

pip install agenthelm

CLI

# Initialize configuration
agenthelm init

# Run a simple task
agenthelm run "What is 2+2?"

# Interactive chat
agenthelm chat

# Generate and execute plans
agenthelm plan "Build a calculator" -o plan.yaml
agenthelm execute plan.yaml

Python SDK

import dspy
from agenthelm import ToolAgent, tool, ExecutionTracer
from agenthelm.core.storage import SqliteStorage

@tool()
def add(a: int, b: int) -> int:
    """Add two numbers."""
    return a + b


# Create agent with tracing
lm = dspy.LM("mistral/mistral-large-latest")
tracer = ExecutionTracer(storage=SqliteStorage("traces.db"))

agent = ToolAgent(
    name="calculator",
    lm=lm,
    tools=[add],
    tracer=tracer,
)

result = agent.run("What is 15 + 27?")
print(result.answer)  # "42"
print(f"Cost: ${result.total_cost_usd:.4f}")

📦 What's Included

Component	Description
`ToolAgent`	ReAct-style agent with tool execution
`PlannerAgent`	LLM-backed multi-step planning
`Orchestrator`	DAG execution with parallel steps
`ExecutionTracer`	Full observability with storage backends
`MCPToolAdapter`	Model Context Protocol integration
`MemoryHub`	Short-term and semantic memory
`@tool` decorator	Approval, retries, compensation, timeout
CLI	`run`, `plan`, `execute`, `chat`, `traces`, `mcp`
OpenTelemetry/Jaeger	Distributed tracing support

🔧 CLI Commands

Command	Description
`agenthelm run "task"`	Execute a task with ToolAgent
`agenthelm plan "task"`	Generate an execution plan
`agenthelm execute file`	Run a plan from YAML
`agenthelm chat`	Interactive REPL mode
`agenthelm traces list`	View execution traces
`agenthelm traces export`	Export to JSON/CSV/Markdown
`agenthelm mcp list-tools`	List MCP server tools
`agenthelm config set`	Configure settings

📊 Observability

Trace Storage

Traces are automatically saved to ~/.agenthelm/traces.db:

# Custom storage path
agenthelm run "task" -s ./my_traces.db

# View traces
agenthelm traces list
agenthelm traces show 0

# Filter and export
agenthelm traces filter --tool search --status success
agenthelm traces export -o report.md -f md

OpenTelemetry / Jaeger

# Start Jaeger
docker run -d -p 16686:16686 -p 4317:4317 jaegertracing/all-in-one

# Run with tracing
agenthelm run "task" --trace

# View in Jaeger UI: http://localhost:16686

🏗️ Architecture

graph TD
    A[Task] --> B[PlannerAgent]
    B --> C[Plan with Steps]
    C --> D[Orchestrator]
    D --> E[ToolAgent 1]
    D --> F[ToolAgent 2]
    E --> G[Tool Execution]
    F --> G
    G --> H[ExecutionTracer]
    H --> I[Storage Backend]

Key Components

PlannerAgent - Generates multi-step plans from natural language
Orchestrator - Executes plans with parallel steps and Saga rollback
ToolAgent - ReAct reasoning loop with tool execution
ExecutionTracer - Captures every decision for observability
MCPClient - Connects to external MCP servers

📚 Documentation

📖 Full Documentation: https://hadywalied.github.io/agenthelm/

🗺️ Roadmap

✅ v0.3.0 (Current)

DSPy-native agents (ToolAgent, PlannerAgent)
Plan-driven execution with YAML export
Multi-agent orchestration with Saga pattern
MCP protocol integration
Full CLI with traces, chat, execute
OpenTelemetry/Jaeger support
Cost and token tracking

🔜 v0.4.0 (Planned)

Budget enforcement (tokens, time, I/O)
Checkpointing for resumable workflows
Policy engine for constraints
Web dashboard for trace visualization

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

🐛 Bug reports
💡 Feature requests
📖 Documentation improvements
🔧 Code contributions

📄 License

MIT License - see LICENSE file for details.

🔗 Links

Documentation: hadywalied.github.io/agenthelm
GitHub: github.com/hadywalied/agenthelm
PyPI: pypi.org/project/agenthelm
Author: Hady Walied | LinkedIn

⭐ Star us on GitHub if AgentHelm helps you build production-ready agents!

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
agenthelm		agenthelm
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CHECKLIST.md		CHECKLIST.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AgentHelm

✨ Key Features

🧠 DSPy-Native Agents

📋 Plan-Driven Execution

🔍 Full Observability

🔗 MCP Protocol Support

🛡️ Transactional Safety

🚀 Quick Start

Installation

CLI

Python SDK

📦 What's Included

🔧 CLI Commands

📊 Observability

Trace Storage

OpenTelemetry / Jaeger

🏗️ Architecture

Key Components

📚 Documentation

🗺️ Roadmap

✅ v0.3.0 (Current)

🔜 v0.4.0 (Planned)

🤝 Contributing

📄 License

🔗 Links

About

Uh oh!

Releases 2

Packages

Contributors 2

Uh oh!

Languages

License

hadywalied/agenthelm

Folders and files

Latest commit

History

Repository files navigation

AgentHelm

✨ Key Features

🧠 DSPy-Native Agents

📋 Plan-Driven Execution

🔍 Full Observability

🔗 MCP Protocol Support

🛡️ Transactional Safety

🚀 Quick Start

Installation

CLI

Python SDK

📦 What's Included

🔧 CLI Commands

📊 Observability

Trace Storage

OpenTelemetry / Jaeger

🏗️ Architecture

Key Components

📚 Documentation

🗺️ Roadmap

✅ v0.3.0 (Current)

🔜 v0.4.0 (Planned)

🤝 Contributing

📄 License

🔗 Links

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

Packages