You Could've Invented Claude Code

In this post, I'll start from scratch and build up to Claude Code's architecture step by step, along the way you'll learn how to powerful agents work under the hood.

Jan 09, 2026

This is cross-posted from the original on x. You can also view it in full markdown here.

While many people use Claude Code daily, few realize that the core of what makes it powerful is surprisingly simple: it’s a loop that lets an AI read files, run commands, and iterate until a task is done.

The complexity comes from handling edge cases, building a good UX, and integrating with real development workflows.

In this post, I’ll start from scratch and build up to Claude Code’s architecture step by step, showing how you could have invented it yourself from first principles, using nothing but a terminal, an LLM API, and the desire to make AI actually useful.

End Goal: Making AI actually useful

First, let’s establish the problem we’re trying to solve.

When you use ChatGPT or Claude in a browser, you’re doing a lot of manual labor:

Copy-paste code from the chat into files
Run commands yourself, then copy errors back
Provide context by uploading files or pasting content
Manually iterate through the fix-test-debug cycle

You’re essentially acting as the AI’s hands. The AI thinks; you execute.

What if the AI could execute too?

Imagine telling an AI: “Fix the bug in auth.py“ and walking away. When you come back, the bug is fixed. The AI read the file, understood it, tried a fix, ran the tests, saw them fail, tried another approach, and eventually succeeded.

This is what an agent does. It’s an AI that can:

Take actions in the real world (read files, run commands)
Observe the results
Decide what to do next
Repeat until the task is complete

Let’s build one from scratch.

The Simplest Possible Agent

Let’s start with the absolute minimum: an AI that can run a single bash command.

#!/bin/bash
# agent-v0.sh - The simplest possible agent

PROMPT="$1"

# Ask Claude what command to run
RESPONSE=$(curl -s https://api.anthropic.com/v1/messages \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "content-type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-opus-4-5-20251101",
    "max_tokens": 1024,
    "messages": [{"role": "user", "content": "'"$PROMPT"'\n\nRespond with ONLY a bash command. No markdown, no explanation, no code blocks."}]
  }')

# Extract the command from response
COMMAND=$(echo "$RESPONSE" | jq -r '.content[0].text')

echo "AI suggests: $COMMAND"
read -r -p "Run this command? (y/n) " CONFIRM

if [ "$CONFIRM" = "y" ]; then
  eval "$COMMAND"
fi

Usage

bash agent-v0.sh "list all Python files in this directory"
# AI suggests: ls *.py
# Run this command? (y/n)

This is... not very useful. The AI can suggest one command, then you’re back to doing everything manually.

But here’s the key insight: what if we put this in a loop?

Goal: Creating the agent loop

The fundamental insight behind all AI agents is the agent loop:

while (task not complete):
    1. AI decides what to do next
    2. Execute that action
    3. Show AI the result
    4. Go back to step 1

Let’s implement exactly this. The AI needs to tell us:

What action to take
Whether it’s done

We’ll use a simple JSON format:

#!/bin/bash
# agent-v1.sh - Agent with a loop

SYSTEM_PROMPT='You are a helpful assistant that can run bash commands.

When the user gives you a task, respond with JSON in this exact format:
{"action": "bash", "command": "your command here"}

When the task is complete, respond with:
{"action": "done", "message": "explanation of what was accomplished"}

Only respond with JSON. No other text.'

# We'll build messages as a JSON array (using jq for proper escaping)
MESSAGES="[]"

run_agent() {
    local USER_MSG="$1"
    
    # Add initial user message using jq to handle escaping
    MESSAGES=$(echo "$MESSAGES" | jq --arg msg "$USER_MSG" '. + [{"role": "user", "content": $msg}]')
    
    while true; do
        # Build the request body properly with jq
        REQUEST_BODY=$(jq -n \
            --arg model "claude-opus-4-5-20251101" \
            --arg system "$SYSTEM_PROMPT" \
            --argjson messages "$MESSAGES" \
            '{model: $model, max_tokens: 1024, system: $system, messages: $messages}')
        
        # Call the API
        RESPONSE=$(curl -s https://api.anthropic.com/v1/messages \
          -H "x-api-key: $ANTHROPIC_API_KEY" \
          -H "content-type: application/json" \
          -H "anthropic-version: 2023-06-01" \
          -d "$REQUEST_BODY")
        # Echo the response for debugging
        AI_TEXT=$(echo "$RESPONSE" | jq -r '.content[0].text')
        
        # Add assistant message to history
        MESSAGES=$(echo "$MESSAGES" | jq --arg msg "$AI_TEXT" '. + [{"role": "assistant", "content": $msg}]')
        
        # Parse the action from the JSON response
        ACTION=$(echo "$AI_TEXT" | jq -r '.action // empty')
        
        if [ -z "$ACTION" ]; then
            echo "❌ Could not parse response: $AI_TEXT"
            break
        elif [ "$ACTION" = "done" ]; then
            echo "✅ $(echo "$AI_TEXT" | jq -r '.message')"
            break
        elif [ "$ACTION" = "bash" ]; then
            COMMAND=$(echo "$AI_TEXT" | jq -r '.command')
            echo "🔧 Running: $COMMAND"
            
            # Execute and capture output
            OUTPUT=$(eval "$COMMAND" 2>&1)
            echo "$OUTPUT"
            
            # Feed result back to AI
            MESSAGES=$(echo "$MESSAGES" | jq --arg msg "Command output: $OUTPUT" '. + [{"role": "user", "content": $msg}]')
        else
            echo "❌ Unknown action: $ACTION"
            break
        fi
    done
}

run_agent "$1"

Now we have something that can actually iterate:

bash agent-v1.sh "Create a file called hello.py that prints hello world, then run it"

# 🔧 Running: echo 'print("hello world")' > hello.py
# 🔧 Running: python hello.py
# hello world
# ✅ Created hello.py and executed it successfully. It prints "hello world".

The AI ran two commands and then told us it was done. We’ve created an agent loop!

But wait! We’re executing arbitrary commands with no safety checks. The AI could rm -rf / and we’d blindly execute it.

Goal: Adding permission controls

Let’s add a human-in-the-loop for dangerous operations. First, we define a function that wraps command execution with a safety check:

# Add this function BEFORE run_agent() in your script
execute_with_permission() {
    local COMMAND="$1"
    
    # Check if command seems dangerous
    if echo "$COMMAND" | grep -qE 'rm |sudo |chmod |curl.*\|.*sh'; then
        # Use >&2 to print to stderr, so prompts display immediately
        # (stdout gets captured by the $(...) in the agent loop)
        echo "⚠️  Potentially dangerous command: $COMMAND" >&2
        echo "Allow? (y/n)" >&2
        read CONFIRM
        if [ "$CONFIRM" != "y" ]; then
            echo "DENIED BY USER"
            return 1
        fi
    fi
    
    eval "$COMMAND" 2>&1
}

Then, inside the agent loop, we replace the direct eval call with our new function:

        # BEFORE:
        OUTPUT=$(eval "$COMMAND" 2>&1)
        
        # AFTER (with permission check):
        OUTPUT=$(execute_with_permission "$COMMAND")

That’s it! The function sits between the AI’s request and actual execution, giving you a chance to block dangerous commands. When denied, you can feed that back to the AI so it can try a different approach.

Try it out:

# Create a test file
echo 'print("hello world")' > hello.py

# Ask the agent to delete it
bash agent-v1.sh "delete the file hello.py"

# 🔧 Running: rm hello.py
# ⚠️  Potentially dangerous command: rm hello.py
# Allow? (y/n)

Type y to allow the deletion, or n to block it.

This is the beginning of a permission system. Claude Code takes this much further with:

Tool-specific permissions (file edits vs. bash commands)
Pattern-based allowlists (Bash(npm test:*) allows any npm test command)
Session-level “accept all” modes for when you trust the AI

The key insight: the human should be able to control what the AI can do, but with enough granularity that it’s not annoying.

Goal: Beyond bash - Adding tools

Running bash commands is powerful, but it’s also:

Dangerous: unlimited access to the system
Inefficient: reading a file shouldn’t spawn a subprocess
Imprecise: output parsing is fragile

What if we gave the AI structured tools instead?

We’ll switch to Python here since it handles JSON and API calls more cleanly:

# agent-v2.py - Agent with structured tools
import anthropic
import json
import os

client = anthropic.Anthropic()

TOOLS = [
    {
        "name": "read_file",
        "description": "Read the contents of a file",
        "input_schema": {
            "type": "object",
            "properties": {
                "path": {"type": "string", "description": "Path to the file"}
            },
            "required": ["path"]
        }
    },
    {
        "name": "write_file",
        "description": "Write content to a file",
        "input_schema": {
            "type": "object",
            "properties": {
                "path": {"type": "string", "description": "Path to the file"},
                "content": {"type": "string", "description": "Content to write"}
            },
            "required": ["path", "content"]
        }
    },
    {
        "name": "run_bash",
        "description": "Run a bash command",
        "input_schema": {
            "type": "object",
            "properties": {
                "command": {"type": "string", "description": "The command to run"}
            },
            "required": ["command"]
        }
    }
]

def execute_tool(name, input):
    """Execute a tool and return the result."""
    if name == "read_file":
        try:
            with open(input["path"], "r") as f:
                return f.read()
        except Exception as e:
            return f"Error: {e}"
    
    elif name == "write_file":
        try:
            with open(input["path"], "w") as f:
                f.write(input["content"])
            return f"Successfully wrote to {input['path']}"
        except Exception as e:
            return f"Error: {e}"
    
    elif name == "run_bash":
        import subprocess
        result = subprocess.run(
            input["command"], 
            shell=True, 
            capture_output=True, 
            text=True
        )
        return result.stdout + result.stderr

def run_agent(task):
    """Main agent loop."""
    messages = [{"role": "user", "content": task}]
    
    while True:
        response = client.messages.create(
            model="claude-opus-4-5-20251101",
            max_tokens=4096,
            tools=TOOLS,
            messages=messages
        )
        
        # Check if we're done
        if response.stop_reason == "end_turn":
            # Extract final text response
            for block in response.content:
                if hasattr(block, "text"):
                    print(f"✅ {block.text}")
            break
        
        # Process tool uses
        if response.stop_reason == "tool_use":
            # Add assistant's response to history
            messages.append({"role": "assistant", "content": response.content})
            
            tool_results = []
            for block in response.content:
                if block.type == "tool_use":
                    print(f"🔧 {block.name}: {json.dumps(block.input)}")
                    result = execute_tool(block.name, block.input)
                    print(f"   → {result[:200]}...")  # Truncate for display
                    
                    tool_results.append({
                        "type": "tool_result",
                        "tool_use_id": block.id,
                        "content": result
                    })
            
            # Add results to conversation
            messages.append({"role": "user", "content": tool_results})

if __name__ == "__main__":
    import sys
    run_agent(sys.argv[1])

Now we’re using Anthropic’s native tool use API. This is much better because:

Type safety: the AI knows exactly what parameters each tool accepts
Explicit actions: reading a file is a read_file call, not cat
Controlled surface area: we decide what tools exist

Try it out:

# Create a test file for the agent to work with
cat > main.py << 'EOF'
def calculate(x, y):
    return x + y

def greet(name):
    print(f"Hello, {name}!")
EOF

# Run the agent
uv run --with anthropic python agent-v2.py "Read main.py and add a docstring to the first function"

# 🔧 read_file: {"path": "main.py"}
#    → def calculate(x, y):...
# 🔧 write_file: {"path": "main.py", "content": "def calculate(x, y):\n    \"\"\"Calculate..."}
#    → Successfully wrote to main.py
# ✅ I've added a docstring to the calculate function explaining its purpose.

Goal: Making edits precise

Our write_file tool has a problem: it replaces the entire file. If the AI makes a small change to a 1000-line file, it has to output all 1000 lines. This is:

Expensive: more output tokens = more cost
Error-prone: the AI might accidentally drop lines
Slow: generating that much text takes time

What if we had a tool for surgical edits?

{
    "name": "edit_file",
    "description": "Make a precise edit to a file by replacing a unique string",
    "input_schema": {
        "type": "object",
        "properties": {
            "path": {"type": "string"},
            "old_str": {"type": "string", "description": "Exact string to find (must be unique in file)"},
            "new_str": {"type": "string", "description": "String to replace it with"}
        },
        "required": ["path", "old_str", "new_str"]
    }
}

Implementation:

def edit_file(path, old_str, new_str):
    with open(path, "r") as f:
        content = f.read()
    
    # Ensure the string is unique
    count = content.count(old_str)
    if count == 0:
        return f"Error: '{old_str}' not found in file"
    if count > 1:
        return f"Error: '{old_str}' found {count} times. Must be unique."
    
    new_content = content.replace(old_str, new_str)
    with open(path, "w") as f:
        f.write(new_content)
    
    return f"Successfully replaced text in {path}"

This is exactly how Claude Code’s str_replace tool works. The requirement for uniqueness might seem annoying, but it’s actually a feature:

Forces the AI to include enough context to be unambiguous
Creates a natural diff that’s easy for humans to review
Prevents accidental mass replacements

Goal: Searching the Codebase

So far our agent can read files it knows about. But what about a task like “find where the authentication bug is”?

The AI needs to search the codebase. Let’s add tools for that.

SEARCH_TOOLS = [
    {
        "name": "glob",
        "description": "Find files matching a pattern",
        "input_schema": {
            "type": "object",
            "properties": {
                "pattern": {"type": "string", "description": "Glob pattern (e.g., '**/*.py')"}
            },
            "required": ["pattern"]
        }
    },
    {
        "name": "grep",
        "description": "Search for a pattern in files",
        "input_schema": {
            "type": "object",
            "properties": {
                "pattern": {"type": "string", "description": "Regex pattern to search for"},
                "path": {"type": "string", "description": "Directory or file to search in"}
            },
            "required": ["pattern"]
        }
    }
]

Now the AI can:

glob(”**/*.py”) → find all Python files
grep(”def authenticate”, “src/”) → find authentication code
read_file(”src/auth.py”) → fead the relevant file
edit_file(...) → fix the bug

This is the pattern: give the AI tools to explore, and it can navigate codebases it’s never seen before.

Goal: Context management

Here’s a problem you’ll hit quickly: context windows are finite.

If you’re working on a large codebase, the conversation might look like:

User: “Fix the bug in authentication”
AI: reads 10 files, runs 20 commands, tries 3 approaches
...conversation is now 100,000 tokens
AI: runs out of context and starts forgetting earlier information

How do we handle this?

Option 1: summarization (compaction)

When context gets too long, summarize what happened:

def compact_conversation(messages):
    """Summarize the conversation to free up context."""
    summary_prompt = """Summarize this conversation concisely, preserving:
    - The original task
    - Key findings and decisions
    - Current state of the work
    - What still needs to be done"""
    
    summary = client.messages.create(
        model="claude-opus-4-5-20251101",
        max_tokens=2000,
        messages=[
            {"role": "user", "content": f"{messages}\n\n{summary_prompt}"}
        ]
    )
    
    return [{"role": "user", "content": f"Previous work summary:\n{summary}"}]

Option 2: sub-agents (delegation)

For complex tasks, spawn a sub-agent with its own context:

def delegate_to_subagent(task, tools_allowed):
    """Spawn a sub-agent for a focused task."""
    result = run_agent(
        task=task,
        tools=tools_allowed,
        max_turns=10  # Prevent infinite loops
    )
    # Only return the result, not the full conversation
    return result.final_answer

This is why Claude Code has the concept of subagents: specialized agents that handle focused tasks in their own context, returning just the results.

Goal: the system prompt

We’ve been glossing over something important: how does the AI know how to behave?

The system prompt is where you encode:

The AI’s identity and capabilities
Guidelines for tool usage
Project-specific context
Behavioral rules

Here’s a simplified version of what makes Claude Code effective:

SYSTEM_PROMPT = """You are an AI assistant that helps with software development tasks.
You have access to the following tools:
- read_file: Read file contents
- write_file: Create or overwrite files
- edit_file: Make precise edits to existing files
- glob: Find files by pattern
- grep: Search for patterns in files
- bash: Run shell commands

## Guidelines

### Before making changes:
1. Understand the task fully before acting
2. Read relevant files to understand context
3. Plan your approach

### When editing code:
1. Use edit_file for small changes (preferred)
2. Use write_file only for new files or complete rewrites
3. Run tests after changes when possible
4. If tests fail, analyze the error and iterate

### General principles:
- Be concise but thorough
- Explain your reasoning briefly
- Ask for clarification if the task is ambiguous
- If you're stuck, say so instead of guessing

## Current Directory
You are working in: {current_directory}
"""

But here’s the problem: what if the project has specific conventions? What if the team uses a particular testing framework, or has a non-standard directory structure?

Goal: Project-Specific Context (CLAUDE.md)

Claude Code solves this with CLAUDE.md - a file at the project root that gets automatically included in context:

# CLAUDE.md

## Project Overview
This is a FastAPI application for user authentication.

## Key Commands
- `make test`: Run all tests
- `make lint`: Run linting
- `make dev`: Start development server

## Architecture
- `src/api/`: API routes
- `src/models/`: Database models
- `src/services/`: Business logic
- `tests/`: Test files (mirror src/ structure)

## Conventions
- All functions must have type hints
- Use pydantic for request/response models
- Write tests before implementing features (TDD)

## Known Issues
- The /auth/refresh endpoint has a race condition (see issue #142)

Now the AI knows:

How to run tests for this project
Where to find things
What conventions to follow
Known gotchas to watch out for

This is one of Claude Code’s most powerful features: project knowledge that travels with the code.

Putting it all together

Let’s see what we’ve built. The core of an AI coding agent is this loop:

1. Setup (runs once)

Load the system prompt with tool descriptions, behavioral guidelines, and project context (CLAUDE.md)
Initialize an empty conversation history

2. Agent Loop (repeats until done)

Send conversation history to the LLM
LLM decides: use a tool or respond to user
If tool use:

Check permissions (prompt user if dangerous)
Execute the tool (read_file, edit_file, bash, glob, grep, etc.)
Add the result to conversation history
Loop back to step 2

If final answer:
- Display response to user
- Done

That’s it. Every AI coding agent, from our 50-line bash script to Claude Code, follows this pattern.

Now let’s build a complete, working mini-Claude Code that you can actually use. It combines everything we’ve learned: the agent loop, structured tools, permission checks, and an interactive REPL:

#!/usr/bin/env python3
# mini-claude-code.py - A minimal Claude Code clone

import anthropic
import subprocess
import os
import json

client = anthropic.Anthropic()

TOOLS = [
    {
        "name": "read_file",
        "description": "Read the contents of a file",
        "input_schema": {
            "type": "object",
            "properties": {
                "path": {"type": "string", "description": "Path to the file"}
            },
            "required": ["path"]
        }
    },
    {
        "name": "write_file",
        "description": "Write content to a file (creates or overwrites)",
        "input_schema": {
            "type": "object",
            "properties": {
                "path": {"type": "string", "description": "Path to the file"},
                "content": {"type": "string", "description": "Content to write"}
            },
            "required": ["path", "content"]
        }
    },
    {
        "name": "list_files",
        "description": "List files in a directory",
        "input_schema": {
            "type": "object",
            "properties": {
                "path": {"type": "string", "description": "Directory path (default: current directory)"}
            }
        }
    },
    {
        "name": "run_command",
        "description": "Run a shell command",
        "input_schema": {
            "type": "object",
            "properties": {
                "command": {"type": "string", "description": "The command to run"}
            },
            "required": ["command"]
        }
    }
]

DANGEROUS_PATTERNS = ["rm ", "sudo ", "chmod ", "mv ", "cp ", "> ", ">>"]

def check_permission(tool_name, tool_input):
    """Check if an action requires user permission."""
    if tool_name == "run_command":
        cmd = tool_input.get("command", "")
        if any(p in cmd for p in DANGEROUS_PATTERNS):
            print(f"\n⚠️  Potentially dangerous command: {cmd}")
            response = input("Allow? (y/n): ").strip().lower()
            return response == "y"
    elif tool_name == "write_file":
        path = tool_input.get("path", "")
        print(f"\n📝 Will write to: {path}")
        response = input("Allow? (y/n): ").strip().lower()
        return response == "y"
    return True

def execute_tool(name, tool_input):
    """Execute a tool and return the result."""
    if name == "read_file":
        path = tool_input["path"]
        try:
            with open(path, "r") as f:
                content = f.read()
            return f"Contents of {path}:\n{content}"
        except Exception as e:
            return f"Error reading file: {e}"

    elif name == "write_file":
        path = tool_input["path"]
        content = tool_input["content"]
        try:
            with open(path, "w") as f:
                f.write(content)
            return f"✅ Successfully wrote to {path}"
        except Exception as e:
            return f"Error writing file: {e}"

    elif name == "list_files":
        path = tool_input.get("path", ".")
        try:
            files = os.listdir(path)
            return f"Files in {path}:\n" + "\n".join(f"  {f}" for f in sorted(files))
        except Exception as e:
            return f"Error listing files: {e}"

    elif name == "run_command":
        cmd = tool_input["command"]
        try:
            result = subprocess.run(cmd, shell=True, capture_output=True, text=True, timeout=30)
            output = result.stdout + result.stderr
            return f"$ {cmd}\n{output}" if output else f"$ {cmd}\n(no output)"
        except subprocess.TimeoutExpired:
            return f"Command timed out after 30 seconds"
        except Exception as e:
            return f"Error running command: {e}"

    return f"Unknown tool: {name}"

def agent_loop(user_message, conversation_history):
    """Run the agent loop until the task is complete."""
    conversation_history.append({"role": "user", "content": user_message})

    while True:
        # Call Claude
        response = client.messages.create(
            model="claude-opus-4-5-20251101",
            max_tokens=4096,
            system=f"You are a helpful coding assistant. Working directory: {os.getcwd()}",
            tools=TOOLS,
            messages=conversation_history
        )

        # Add assistant response to history
        conversation_history.append({"role": "assistant", "content": response.content})

        # Check if we're done (no tool use)
        if response.stop_reason == "end_turn":
            # Print the final text response
            for block in response.content:
                if hasattr(block, "text"):
                    print(f"\n🤖 {block.text}")
            break

        # Process tool calls
        tool_results = []
        for block in response.content:
            if block.type == "tool_use":
                tool_name = block.name
                tool_input = block.input

                print(f"\n🔧 {tool_name}: {json.dumps(tool_input)}")

                # Check permissions
                if not check_permission(tool_name, tool_input):
                    result = "Permission denied by user"
                    print(f"   🚫 {result}")
                else:
                    result = execute_tool(tool_name, tool_input)
                    # Truncate long output for display
                    display = result[:200] + "..." if len(result) > 200 else result
                    print(f"   → {display}")

                tool_results.append({
                    "type": "tool_result",
                    "tool_use_id": block.id,
                    "content": result
                })

        # Add tool results to conversation
        conversation_history.append({"role": "user", "content": tool_results})

    return conversation_history

def main():
    print("Mini Claude Code")
    print(" Type your requests, or 'quit' to exit.\n")

    conversation_history = []

    while True:
        try:
            user_input = input("You: ").strip()
        except (EOFError, KeyboardInterrupt):
            print("\nGoodbye!")
            break

        if not user_input:
            continue
        if user_input.lower() in ["quit", "exit", "q"]:
            print("Goodbye!")
            break

        conversation_history = agent_loop(user_input, conversation_history)

if __name__ == "__main__":
    main()

Save this as mini-claude-code.py and run it:

uv run --with anthropic python mini-claude-code.py

Here’s what a session looks like:

Mini Claude Code
 Type your requests, or 'quit' to exit.

You: create a python file that prints the fibonacci sequence up to n

🔧 write_file: {"path": "fibonacci.py", "content": "def fibonacci(n):\n    ..."}

📝 Will write to: fibonacci.py
Allow? (y/n): y
   → ✅ Successfully wrote to fibonacci.py

🤖 I've created fibonacci.py with a function that prints the Fibonacci sequence.
   Would you like me to run it to test it?

You: yes, run it with n=10

🔧 run_command: {"command": "python fibonacci.py 10"}
   → $ python fibonacci.py 10
     0 1 1 2 3 5 8 13 21 34

🤖 The script works correctly! It printed the first 10 Fibonacci numbers.

You: quit
Goodbye!

That’s a working mini Claude Code clone in ~150 lines. It has:

Interactive REPL: keeps conversation context between prompts
Multiple tools: read, write, list files, run commands
Permission checks: asks before writing files or running dangerous commands
Conversation memory: each follow-up builds on previous context

This is essentially what Claude Code does, plus:

A polished terminal UI
Sophisticated permission system
Context compaction when conversations get long
Subagent delegation for complex tasks
Hooks for custom automation
Integration with git and other dev tools

The Claude Agent SDK

If you want to build on this foundation without reinventing the wheel, Anthropic offers the Claude Agent SDK. It’s the same engine that powers Claude Code, exposed as a library.

Here’s what our simple agent looks like using the SDK:

import { query } from "@anthropic-ai/claude-agent-sdk";

for await (const message of query({
  prompt: "Fix the bug in auth.py",
  options: {
    model: "claude-opus-4-5-20251101",
    allowedTools: ["Read", "Edit", "Bash", "Glob", "Grep"],
    maxTurns: 50
  }
})) {
  if (message.type === "assistant") {
    for (const block of message.message.content) {
      if ("text" in block) {
        console.log(block.text);
      } else if ("name" in block) {
        console.log(`Using tool: ${block.name}`);
      }
    }
  }
}

The SDK handles:

The agent loop (so you don’t have to)
All the built-in tools (Read, Write, Edit, Bash, Glob, Grep, etc.)
Permission management
Context tracking
Sub-agent coordination

What We’ve Learned

Starting from a simple bash script, we discovered:

The agent loop: AI decides → execute → observe → repeat
Structured tools: better than raw bash for safety and precision
Surgical edits: str_replace beats full file rewrites
Search tools: let the AI explore codebases
Context management: compaction and delegation handle long tasks
Project knowledge:
CLAUDE.md
gives project-specific context

Each of these emerged from a practical problem:

“How do I make the AI do more than one thing?” → agent loop
“How do I prevent it from destroying my system?” → permission system
“How do I make edits efficient?” → str_replace tool
“How does it find code it doesn’t know about?” → search tools
“What happens when context runs out?” → compaction
“How does it know my project’s conventions?” →
CLAUDE.md

This is how you could have invented Claude Code. The core ideas are simple.

The complexity comes from handling edge cases, building a good UX, and integrating with real development workflows.

Next Steps

If you want to build your own agents:

Start simple: a basic agent loop with 2-3 tools
Add tools incrementally: each new capability should solve a real problem
Handle errors gracefully: tools fail; your agent should recover
Test on real tasks: the edge cases will teach you what’s missing
Consider using the Claude Agent SDK: why reinvent the wheel?

The future of software development is agents that can actually do things. Now we know how they work!

Resources:

The Complete Guide to Building Agents with the Claude Agent SDK

The fast path to get up and running your own custom AI Agents.

Nader Dabit

Jan 08, 2026

This is cross-posted from this post on x. You can also view it in markdown in it’s entirety here.

If you’ve used Claude Code, you’ve seen what an AI agent can actually do: read files, run commands, edit code, figure out the steps to accomplish a task.

And you know it doesn’t just help you write code, it takes ownership of problems and works through them the way a thoughtful engineer would.

The Claude Agent SDK is the same engine, yours to point at whatever problem you want, so you can easily build agents of your own.

It’s the infrastructure behind Claude Code, exposed as a library. You get the agent loop, the built-in tools, the context management, basically everything you’d otherwise have to build yourself.

This guide walks through building a code review agent from scratch. By the end, you’ll have something that can analyze a codebase, find bugs and security issues, and return structured feedback.

More importantly, you’ll understand how the SDK works so you can build whatever you actually need.

What we’re building

Our code review agent will:

Analyze a codebase for bugs and security issues
Read files and search through code autonomously
Provide structured, actionable feedback
Track its progress as it works

The stack

• Runtime - Claude Code CLI
• SDK - @anthropic-ai/claude-agent-sdk
• Language - TypeScript
• Model - Claude Opus 4.5

What the SDK gives you

If you’ve built agents with the raw API, you know the pattern: call the model, check if it wants to use a tool, execute the tool, feed the result back, repeat until done. This can get tedious when building anything non-trivial.

The SDK handles that loop:

// Without the SDK: You manage the loop
let response = await client.messages.create({...});
while (response.stop_reason === "tool_use") {
  const result = yourToolExecutor(response.tool_use);
  response = await client.messages.create({ tool_result: result, ... });
}

// With the SDK: Claude manages it
for await (const message of query({ prompt: "Fix the bug in auth.py" })) {
  console.log(message); // Claude reads files, finds bugs, edits code
}

You also get working tools out of the box:

• Read - read any file in the working directory
• Write - create new files
• Edit - make precise edits to existing files
• Bash - run terminal commands
• Glob- find files by pattern
• Grep - search file contents with regex
• WebSearch - search the web
• WebFetch - fetch and parse web pages

You don’t have to implement any of this yourself.

Prerequisites

Node.js 18+ installed
An Anthropic API key (get one here)

Getting started

Step 1: Install Claude Code CLI

The Agent SDK uses Claude Code as its runtime:

npm install -g @anthropic-ai/claude-code

After installing, run claude in your terminal and follow the prompts to authenticate.

Step 2: Create your project

mkdir code-review-agent && cd code-review-agent
npm init -y
npm install @anthropic-ai/claude-agent-sdk
npm install -D typescript @types/node tsx

Step 3: Set your API key

export ANTHROPIC_API_KEY=your-api-key

Your first agent

Create agent.ts:

import { query } from "@anthropic-ai/claude-agent-sdk";

async function main() {
  for await (const message of query({
    prompt: "What files are in this directory?",
    options: {
      model: "opus",
      allowedTools: ["Glob", "Read"],
      maxTurns: 250
    }
  })) {
    if (message.type === "assistant") {
      for (const block of message.message.content) {
        if ("text" in block) {
          console.log(block.text);
        }
      }
    }
    
    if (message.type === "result") {
      console.log("\nDone:", message.subtype);
    }
  }
}

main();

Run it:

npx tsx agent.ts

Claude will use the Glob tool to list files and tell you what it found.

Understanding the message stream

The query() function returns an async generator that streams messages as Claude works. Here are the key message types:

for await (const message of query({ prompt: "..." })) {
  switch (message.type) {
    case "system":
      // Session initialization info
      if (message.subtype === "init") {
        console.log("Session ID:", message.session_id);
        console.log("Available tools:", message.tools);
      }
      break;
      
    case "assistant":
      // Claude's responses and tool calls
      for (const block of message.message.content) {
        if ("text" in block) {
          console.log("Claude:", block.text);
        } else if ("name" in block) {
          console.log("Tool call:", block.name);
        }
      }
      break;
      
    case "result":
      // Final result
      console.log("Status:", message.subtype); // "success" or error type
      console.log("Cost:", message.total_cost_usd);
      break;
  }
}

Building a code review agent

Now let’s build something useful. Create review-agent.ts:

import { query } from "@anthropic-ai/claude-agent-sdk";

async function reviewCode(directory: string) {
  console.log(`\n🔍 Starting code review for: ${directory}\n`);
  
  for await (const message of query({
    prompt: `Review the code in ${directory} for:
1. Bugs and potential crashes
2. Security vulnerabilities  
3. Performance issues
4. Code quality improvements

Be specific about file names and line numbers.`,
    options: {
      model: "opus",
      allowedTools: ["Read", "Glob", "Grep"],
      permissionMode: "bypassPermissions", // Auto-approve read operations
      maxTurns: 250
    }
  })) {
    // Show Claude's analysis as it happens
    if (message.type === "assistant") {
      for (const block of message.message.content) {
        if ("text" in block) {
          console.log(block.text);
        } else if ("name" in block) {
          console.log(`\n📁 Using ${block.name}...`);
        }
      }
    }
    
    // Show completion status
    if (message.type === "result") {
      if (message.subtype === "success") {
        console.log(`\n✅ Review complete! Cost: $${message.total_cost_usd.toFixed(4)}`);
      } else {
        console.log(`\n❌ Review failed: ${message.subtype}`);
      }
    }
  }
}

// Review the current directory
reviewCode(".");

Testing It Out

Create a file with some intentional issues. Create example.ts:

function processUsers(users: any) {
  for (let i = 0; i <= users.length; i++) { // Off-by-one error
    console.log(users[i].name.toUpperCase()); // No null check
  }
}

function connectToDb(password: string) {
  const connectionString = `postgres://admin:${password}@localhost/db`;
  console.log("Connecting with:", connectionString); // Logging sensitive data
}

async function fetchData(url) { // Missing type annotation
  const response = await fetch(url);
  return response.json(); // No error handling
}

Run the review:

bash

npx tsx review-agent.ts

Claude will identify the bugs, security issues, and suggest fixes.

Adding Structured Output

For programmatic use, you’ll want structured data. The SDK supports JSON Schema output:

import { query } from "@anthropic-ai/claude-agent-sdk";

const reviewSchema = {
  type: "object",
  properties: {
    issues: {
      type: "array",
      items: {
        type: "object",
        properties: {
          severity: { type: "string", enum: ["low", "medium", "high", "critical"] },
          category: { type: "string", enum: ["bug", "security", "performance", "style"] },
          file: { type: "string" },
          line: { type: "number" },
          description: { type: "string" },
          suggestion: { type: "string" }
        },
        required: ["severity", "category", "file", "description"]
      }
    },
    summary: { type: "string" },
    overallScore: { type: "number" }
  },
  required: ["issues", "summary", "overallScore"]
};

async function reviewCodeStructured(directory: string) {
  for await (const message of query({
    prompt: `Review the code in ${directory}. Identify all issues.`,
    options: {
      model: "opus",
      allowedTools: ["Read", "Glob", "Grep"],
      permissionMode: "bypassPermissions",
      maxTurns: 250,
      outputFormat: {
        type: "json_schema",
        schema: reviewSchema
      }
    }
  })) {
    if (message.type === "result" && message.subtype === "success") {
      const review = message.structured_output as {
        issues: Array<{
          severity: string;
          category: string;
          file: string;
          line?: number;
          description: string;
          suggestion?: string;
        }>;
        summary: string;
        overallScore: number;
      };
      
      console.log(`\n📊 Code Review Results\n`);
      console.log(`Score: ${review.overallScore}/100`);
      console.log(`Summary: ${review.summary}\n`);
      
      for (const issue of review.issues) {
        const icon = issue.severity === "critical" ? "🔴" :
                     issue.severity === "high" ? "🟠" :
                     issue.severity === "medium" ? "🟡" : "🟢";
        console.log(`${icon} [${issue.category.toUpperCase()}] ${issue.file}${issue.line ? `:${issue.line}` : ""}`);
        console.log(`   ${issue.description}`);
        if (issue.suggestion) {
          console.log(`   💡 ${issue.suggestion}`);
        }
        console.log();
      }
    }
  }
}

reviewCodeStructured(".");

Handling permissions

By default, the SDK asks for approval before executing tools. You can customize this:

Permission modes

options: {
  // Standard mode - prompts for approval
  permissionMode: "default",
  
  // Auto-approve file edits
  permissionMode: "acceptEdits",
  
  // No prompts (use with caution)
  permissionMode: "bypassPermissions"
}

Custom permission handler

For fine-grained control, use canUseTool:

options: {
  canUseTool: async (toolName, input) => {
    // Allow all read operations
    if (["Read", "Glob", "Grep"].includes(toolName)) {
      return { behavior: "allow", updatedInput: input };
    }
    
    // Block writes to certain files
    if (toolName === "Write" && input.file_path?.includes(".env")) {
      return { behavior: "deny", message: "Cannot modify .env files" };
    }
    
    // Allow everything else
    return { behavior: "allow", updatedInput: input };
  }
}

Creating subagents

For complex tasks, you can create specialized subagents:

import { query, AgentDefinition } from "@anthropic-ai/claude-agent-sdk";

async function comprehensiveReview(directory: string) {
  for await (const message of query({
    prompt: `Perform a comprehensive code review of ${directory}. 
Use the security-reviewer for security issues and test-analyzer for test coverage.`,
    options: {
      model: "opus",
      allowedTools: ["Read", "Glob", "Grep", "Task"], // Task enables subagents
      permissionMode: "bypassPermissions",
      maxTurns: 250,
      agents: {
        "security-reviewer": {
          description: "Security specialist for vulnerability detection",
          prompt: `You are a security expert. Focus on:
- SQL injection, XSS, CSRF vulnerabilities
- Exposed credentials and secrets
- Insecure data handling
- Authentication/authorization issues`,
          tools: ["Read", "Grep", "Glob"],
          model: "sonnet"
        } as AgentDefinition,
        
        "test-analyzer": {
          description: "Test coverage and quality analyzer",
          prompt: `You are a testing expert. Analyze:
- Test coverage gaps
- Missing edge cases
- Test quality and reliability
- Suggestions for additional tests`,
          tools: ["Read", "Grep", "Glob"],
          model: "haiku" // Use faster model for simpler analysis
        } as AgentDefinition
      }
    }
  })) {
    if (message.type === "assistant") {
      for (const block of message.message.content) {
        if ("text" in block) {
          console.log(block.text);
        } else if ("name" in block && block.name === "Task") {
          console.log(`\n🤖 Delegating to: ${(block.input as any).subagent_type}`);
        }
      }
    }
  }
}

comprehensiveReview(".");

Session management

For multi-turn conversations, capture and resume sessions:

import { query } from "@anthropic-ai/claude-agent-sdk";

async function interactiveReview() {
  let sessionId: string | undefined;
  
  // Initial review
  for await (const message of query({
    prompt: "Review this codebase and identify the top 3 issues",
    options: {
      model: "opus",
      allowedTools: ["Read", "Glob", "Grep"],
      permissionMode: "bypassPermissions",
      maxTurns: 250
    }
  })) {
    if (message.type === "system" && message.subtype === "init") {
      sessionId = message.session_id;
    }
    // ... handle messages
  }
  
  // Follow-up question using same session
  if (sessionId) {
    for await (const message of query({
      prompt: "Now show me how to fix the most critical issue",
      options: {
        resume: sessionId, // Continue the conversation
        allowedTools: ["Read", "Glob", "Grep"],
        maxTurns: 250
      }
    })) {
      // Claude remembers the previous context
    }
  }
}

Using hooks

Hooks let you intercept and customize agent behavior:

import { query, HookCallback, PreToolUseHookInput } from "@anthropic-ai/claude-agent-sdk";

// Hook callbacks receive three arguments:
// 1. input - details about the event (tool name, arguments, etc.)
// 2. toolUseId - correlates PreToolUse and PostToolUse events for the same call
// 3. context - contains AbortSignal for cancellation
const auditLogger: HookCallback = async (input, toolUseId, { signal }) => {
  if (input.hook_event_name === "PreToolUse") {
    const preInput = input as PreToolUseHookInput;
    console.log(`[AUDIT] ${new Date().toISOString()} - ${preInput.tool_name}`);
  }
  return {}; // Return empty object to allow the operation
};

const blockDangerousCommands: HookCallback = async (input, toolUseId, { signal }) => {
  if (input.hook_event_name === "PreToolUse") {
    const preInput = input as PreToolUseHookInput;
    if (preInput.tool_name === "Bash") {
      const command = (preInput.tool_input as any).command || "";
      if (command.includes("rm -rf") || command.includes("sudo")) {
        return {
          hookSpecificOutput: {
            hookEventName: "PreToolUse",
            permissionDecision: "deny",  // Block the tool from executing
            permissionDecisionReason: "Dangerous command blocked"
          }
        };
      }
    }
  }
  return {};
};

for await (const message of query({
  prompt: "Clean up temporary files",
  options: {
    model: "opus",
    allowedTools: ["Bash", "Glob"],
    maxTurns: 50,
    hooks: {
      // PreToolUse fires before each tool executes
      // Other hooks: PostToolUse, Stop, SessionStart, SessionEnd, etc.
      PreToolUse: [
        // Each entry has an optional matcher (regex) and an array of callbacks
        // No matcher = runs for ALL tool calls
        { hooks: [auditLogger] },
        
        // matcher: 'Bash' = only runs when tool_name matches 'Bash'
        // Use regex for multiple tools: 'Bash|Write|Edit'
        { matcher: "Bash", hooks: [blockDangerousCommands] }
      ]
    }
  }
})) {
  if (message.type === "assistant") {
    for (const block of message.message.content) {
      if ("text" in block) {
        console.log(block.text);
      }
    }
  }
}

Custom tool calling

Tools are how agents interact with the world - reading files, calling APIs, querying databases, running code. The SDK includes built-in tools for common operations (filesystem, shell, web), but most agents will need custom tools to access your own systems.

The raw API pattern

Without the SDK, you manage the tool loop yourself:

// 1. Define tools with their schemas
const tools = [{
  name: "get_weather",
  description: "Get current weather for a city",
  input_schema: {
    type: "object",
    properties: {
      city: { type: "string", description: "City name" }
    },
    required: ["city"]
  }
}];

// 2. Write an executor for each tool
function executeTool(name: string, input: any): string {
  if (name === "get_weather") {
    return fetchWeatherAPI(input.city);
  }
  throw new Error(`Unknown tool: ${name}`);
}

// 3. Run the agent loop
const messages = [{ role: "user", content: "What's the weather in Tokyo?" }];

let response = await client.messages.create({
  model: "claude-opus-4-5-20251101",
  tools,
  messages
});

while (response.stop_reason === "tool_use") {
  messages.push({ role: "assistant", content: response.content });
  
  const toolResults = response.content
    .filter(block => block.type === "tool_use")
    .map(toolUse => ({
      type: "tool_result",
      tool_use_id: toolUse.id,
      content: executeTool(toolUse.name, toolUse.input)
    }));
  
  messages.push({ role: "user", content: toolResults });
  response = await client.messages.create({ model, tools, messages });
}

const textBlock = response.content.find(block => block.type === "text");
if (textBlock && textBlock.type === "text") {
  console.log("Final response:", textBlock.text);
}

Key points:

Claude decides when to use tools based on the user’s request and tool descriptions
You execute the tools and return results
The loop continues until Claude has enough information (stop_reason: "end_turn")
Message history grows with each iteration - the API is stateless, so every request needs the full conversation

What the SDK handles

When you use built-in tools like allowedTools: ["Read", "Glob"], the SDK manages all of this automatically - definitions, execution, and the loop.

For custom tools, you need a way to define them so the SDK can do the same. That’s what MCP provides.

Adding custom tools with MCP

Extend Claude with custom tools using Model Context Protocol:

import { query, tool, createSdkMcpServer } from "@anthropic-ai/claude-agent-sdk";
import { z } from "zod";

// Create your custom MCP server
const customServer = createSdkMcpServer({
  name: "code-metrics",
  version: "1.0.0",
  tools: [
    // Define a custom tool using the `tool` helper
    // Arguments: name, description, input schema, handler function
    tool(
      "analyze_complexity",
      "Calculate cyclomatic complexity for a file",
      {
        // Zod schema defines what inputs the tool accepts
        filePath: z.string().describe("Path to the file to analyze")
      },
      // Handler function - runs when Claude calls the tool
      async (args) => {
        // In real implementation, calculate actual complexity 
        const complexity = Math.floor(Math.random() * 20) + 1;
        
        // Return format required by MCP - array of content blocks
        return {
          content: [{
            type: "text",
            text: `Cyclomatic complexity for ${args.filePath}: ${complexity}`
          }]
        };
      }
    )
  ]
});

async function analyzeCode(filePath: string) {
  for await (const message of query({
    prompt: `Analyze the complexity of ${filePath}`,
    options: {
      model: "opus",
      
      // Register the custom MCP server
      // The key ("code-metrics") becomes part of the tool name
      mcpServers: {
        "code-metrics": customServer
      },
      
      // Specify which tools Claude can use
      // MCP tools follow the pattern: mcp__<server-name>__<tool-name>
      allowedTools: ["Read", "mcp__code-metrics__analyze_complexity"],
      
      // Maximum number of back-and-forth turns before stopping
      maxTurns: 50
    }
  })) {
    // Handle assistant messages (Claude's responses and tool calls)
    if (message.type === "assistant") {
      for (const block of message.message.content) {
        // Text blocks contain Claude's written responses
        if ("text" in block) {
          console.log(block.text);
        }
      }
    }
    
    // Handle the final result when the agent loop completes
    if (message.type === "result") {
      console.log("Done:", message.subtype); // "success" or an error type
    }
  }
}

analyzeCode("main.ts");

Cost tracking

Track API costs for billing:

for await (const message of query({ prompt: "..." })) {
  if (message.type === "result" && message.subtype === "success") {
    console.log("Total cost:", message.total_cost_usd);
    console.log("Token usage:", message.usage);
    
    // Per-model breakdown (useful with subagents)
    for (const [model, usage] of Object.entries(message.modelUsage)) {
      console.log(`${model}: $${usage.costUSD.toFixed(4)}`);
    }
  }
}

Production code review agent

Here’s a production-ready agent that ties everything together:

import { query, AgentDefinition } from "@anthropic-ai/claude-agent-sdk";

interface ReviewResult {
  issues: Array<{
    severity: "low" | "medium" | "high" | "critical";
    category: "bug" | "security" | "performance" | "style";
    file: string;
    line?: number;
    description: string;
    suggestion?: string;
  }>;
  summary: string;
  overallScore: number;
}

const reviewSchema = {
  type: "object",
  properties: {
    issues: {
      type: "array",
      items: {
        type: "object",
        properties: {
          severity: { type: "string", enum: ["low", "medium", "high", "critical"] },
          category: { type: "string", enum: ["bug", "security", "performance", "style"] },
          file: { type: "string" },
          line: { type: "number" },
          description: { type: "string" },
          suggestion: { type: "string" }
        },
        required: ["severity", "category", "file", "description"]
      }
    },
    summary: { type: "string" },
    overallScore: { type: "number" }
  },
  required: ["issues", "summary", "overallScore"]
};

async function runCodeReview(directory: string): Promise<ReviewResult | null> {
  console.log(`\n${"=".repeat(50)}`);
  console.log(`🔍 Code Review Agent`);
  console.log(`📁 Directory: ${directory}`);
  console.log(`${"=".repeat(50)}\n`);

  let result: ReviewResult | null = null;

  for await (const message of query({
    prompt: `Perform a thorough code review of ${directory}.

Analyze all source files for:
1. Bugs and potential runtime errors
2. Security vulnerabilities
3. Performance issues
4. Code quality and maintainability

Be specific with file paths and line numbers where possible.`,
    options: {
      model: "opus",
      allowedTools: ["Read", "Glob", "Grep", "Task"],
      permissionMode: "bypassPermissions",
      maxTurns: 250,
      outputFormat: {
        type: "json_schema",
        schema: reviewSchema
      },
      agents: {
        "security-scanner": {
          description: "Deep security analysis for vulnerabilities",
          prompt: `You are a security expert. Scan for:
- Injection vulnerabilities (SQL, XSS, command injection)
- Authentication and authorization flaws
- Sensitive data exposure
- Insecure dependencies`,
          tools: ["Read", "Grep", "Glob"],
          model: "sonnet"
        } as AgentDefinition
      }
    }
  })) {
    // Progress updates
    if (message.type === "assistant") {
      for (const block of message.message.content) {
        if ("name" in block) {
          if (block.name === "Task") {
            console.log(`🤖 Delegating to: ${(block.input as any).subagent_type}`);
          } else {
            console.log(`📂 ${block.name}: ${getToolSummary(block)}`);
          }
        }
      }
    }

    // Final result
    if (message.type === "result") {
      if (message.subtype === "success" && message.structured_output) {
        result = message.structured_output as ReviewResult;
        console.log(`\n✅ Review complete! Cost: $${message.total_cost_usd.toFixed(4)}`);
      } else {
        console.log(`\n❌ Review failed: ${message.subtype}`);
      }
    }
  }

  return result;
}

function getToolSummary(block: any): string {
  const input = block.input || {};
  switch (block.name) {
    case "Read": return input.file_path || "file";
    case "Glob": return input.pattern || "pattern";
    case "Grep": return `"${input.pattern}" in ${input.path || "."}`;
    default: return "";
  }
}

function printResults(result: ReviewResult) {
  console.log(`\n${"=".repeat(50)}`);
  console.log(`📊 REVIEW RESULTS`);
  console.log(`${"=".repeat(50)}\n`);
  
  console.log(`Score: ${result.overallScore}/100`);
  console.log(`Issues Found: ${result.issues.length}\n`);
  console.log(`Summary: ${result.summary}\n`);
  
  const byCategory = {
    critical: result.issues.filter(i => i.severity === "critical"),
    high: result.issues.filter(i => i.severity === "high"),
    medium: result.issues.filter(i => i.severity === "medium"),
    low: result.issues.filter(i => i.severity === "low")
  };
  
  for (const [severity, issues] of Object.entries(byCategory)) {
    if (issues.length === 0) continue;
    
    const icon = severity === "critical" ? "🔴" :
                 severity === "high" ? "🟠" :
                 severity === "medium" ? "🟡" : "🟢";
    
    console.log(`\n${icon} ${severity.toUpperCase()} (${issues.length})`);
    console.log("-".repeat(30));
    
    for (const issue of issues) {
      const location = issue.line ? `${issue.file}:${issue.line}` : issue.file;
      console.log(`\n[${issue.category}] ${location}`);
      console.log(`  ${issue.description}`);
      if (issue.suggestion) {
        console.log(`  💡 ${issue.suggestion}`);
      }
    }
  }
}

// Run the review
async function main() {
  const directory = process.argv[2] || ".";
  const result = await runCodeReview(directory);
  
  if (result) {
    printResults(result);
  }
}

main().catch(console.error);

Run it:

npx tsx review-agent.ts ./src

What’s next

The code review agent covers the essentials: query(), allowedTools, structured output, subagents, and permissions.

If you want to go deeper:

More capabilities

File checkpointing - track and revert file changes
Skills - package reusable capabilities

Production deployment

Hosting - deploy in containers and CI/CD
Secure deployment - sandboxing and credential management

Full reference

This guide covers V1 of the SDK.
V2 is currently in development. I will update this guide with V2 once it’s released and stable.

If you’re interested in building verifiable agents, check out the work we’re doing at EigenCloud here

EigenLayer for Developers

A vibrant new ecosystem for builders.

Nader Dabit

Mar 05, 2024

To get started building on EigenLayer, check out the documentation here.

While introductions to EigenLayer are often focused on restaking, what got me interested was learning how this was just the means to an end to accomplish something exciting and powerful, and that I’m still wrapping my head around.

EigenLayer enables developers to build arbitrary distributed systems on top of the Ethereum trust network.

It helps developers launch chains, decentralized networks, and Proof of Stake (PoS) systems at 10x speed, ease, security, and distribution than possible before.

It brings the power of Ethereum outside the EVM to additional apps on the network, making it an order of magnitude easier to build these systems by extending the programmability of Ethereum.

What does EigenLayer solve

Developers who build decentralized infrastructure face the challenge of establishing their own economic security.

While developers can use protocols like Ethereum to provide economic security for smart contracts, infrastructures like bridges, sequencers, oracles, and specialized data availability layers require their own economic security.

EigenLayer solves this problem by enabling any service, regardless of its composition (e.g. EVM-compatibility), to tap into the pooled security of Ethereum's stakers.

To put it simply, EigenLayer makes it much easier and more economically viable to build chains, distributed systems, or any PoS network.

How blockchain networks have evolved

To understand this better, let’s take a quick look at how blockchains have evolved over the past 15 years.

Application-specific blockchains

The first blockchain networks, like Bitcoin and Litecoin, were application-specific.

If someone wanted to build a slightly different network, they had to build the entire protocol from the ground up.

Therefore, blockchain developers at this time were literally building or maintaining the blockchains themselves, there was not concept of a “blockchain application” in the sense of running an app on one of these networks.

Programmable blockchains

With Ethereum and the EVM, blockchains became programmable.

Developers could now easily write a smart contract and deploy it to an existing general purpose chain, inheriting the liquidity and security of the underlying network.

This made developing apps an order of magnitude easier, and lowered the barrier to entry for developers wanting to build.

Programmable blockchains ushered in the current state of innovation, decentralized apps, and web3 — it was now simple to experiment with new ideas without dedicating a large amount of resources in doing so.

Ethereum’s rollup-centric roadmap & scalability

As more apps and users have come online, it’s become obvious that the initial architecture of Ethereum and the EVM just doesn’t scale.

When building software there are two approaches to scalability, vertical and horizontal. Vertical scaling involves increasing compute resources (CPU, RAM, storage, etc.), while horizontal involves adding more servers / instances to distribute load - e.g. adding more app servers behind a load balancer.

Ethereum and the EVM are taking both approaches, with the network itself incorporating a rollup-centric roadmap to accomplish horizontal scaling.

Rollups outsource execution and transaction processing off of the main Ethereum (or L1) network, while still inheriting Ethereum security guarantees.

However, networks and protocols not proven by or deployed to the EVM cannot inherit these security and trust guarantees. Examples include sidechains, oracles, bridges, data availability layers, decentralized databases, decentralized storage, and sequencers among other things.

Bootstrapping a network from scratch

There are major challenges in building these types of networks and services from scratch.

Potential stakers have to first be identified and educated about what you are building.

These stakers also must forego taking part in, and receiving rewards from, other existing, trusted, and well known opportunities. They must be convinced instead to invest a significant amount of money and opportunity cost to take part in your new network, usually by buying the new network's native token, which is generally volatile and hard to get.

Building in this way also results in an undesirable and low trust model for apps using your service. The cost of corruption of the app will now be the minimum cost needed to compromise its weakest infrastructural dependency.

Introducing EigenLayer

Like Ethereum enabled smart contract developers to easily build applications that inherit the security and trust of Ethereum, EigenLayer enables developers to build any distributed system or decentralized network, while still leveraging the same security and trust guarantees of Ethereum.

This is a paradigm shift in how decentralized protocols and Proof of Stake networks are built. EigenLayer is already ushering in a new era of permissionless innovation and I believe will unleash a new generation of apps.

At EigenLayer we call these type of services an AVS (Actively Validated Service).

So what are the types of services that can be built? The design space is infinite, but here are some verticals that developers are already focusing on:

Specialization and experimentation

Like smart contracts drastically lowered the barrier to entry and opportunity cost of new experimentation for blockchain developers, EigenLayer does the same for developers building any type of chain, distributed system, or PoS network.

This means developers will be more likely to try new things because, even if they fail, the cost has been low enough for it to make sense.

There is also a higher incentive for specialization, because the opportunity cost is much lower and the incentive mechanisms much more streamlined and efficient.

Just like in the “web2” world we see deep specialization in countless verticals like arbitrary compute, databases, and serverless platforms, we’ll begin to see more specialization for decentralized infrastructure as the cost continues to go down for developers building in this space.

Specialization and experimentation are accelerated when the amount of work needed to build something is reduced dramatically. When you pair that with a better monetary value proposition for builders, there is the potential for unlocking a wave of new innovation.

For instance, to build a PoS network from scratch there are a lot of requirements (thanks to @13yearoldvc for the diagrams):

With EigenLayer, much of this architecture and handled for you:

Dual Staking

One of the most powerful features of EigenLayer is dual staking.

Problem

One of the reasons bootstrapping a new PoS network is hard is because it involves issuing a new token, then persuading people to buy and stake the token.

These tokens are often volatile, more risky, and hard to access.

Because the value of the new native token is closely tied to the system’s overall security, early PoS networks also face the potential “death spiral”.

A decrease in token value weakens the chain's security, resulting in capital flight (decrease in TVL), resulting decrease in native token price.

Solution

With Dual staking, two tokens can be used to secure the same PoS network.

Instead of requiring stakers to buy and hold a new, more volatile token, they can instead stake ETH - a token with lower volatility, deeper liquidity, and more access. Alongside staking ETH, they can also stake the network’s native token.

You can learn more about how dual staking works here.

EigenLayer Ecosystem and resources

Interesting AVSs and rollups building on EigenLayer

There are already many AVSs and chains being built on EigenLayer. Check out the EigenLayer ecosystem page to learn more.

Hello World

The easiest way to get started building an AVS is to start with Hello World.

Additional Resources

To learn more about EigenLayer, I’ve created a curated list of my own favorite learning resources here.

Building Apps with World ID

How to build a sybil-resistant app with World ID device authentication

Nader Dabit

Feb 16, 2024

You can test out the app we’ll be building in this tutorial here. The final codebase is here.

World ID enables you to prove you’re a unique and real human on the internet while keeping your identity private.

For developers, this translates to providing a means of sybil resistance that is becoming both increasingly important, and more challenging, in a world of accelerating AI and automation.

I’ve experimented with many sybil resistance solutions in the past such as GitCoin Passport and the tradeoffs in UX and usability seem very challenging, and there doesn’t yet seem to be a silver bullet, but I feel like World ID’s approach is quite promising.

In the past, WorldCoin brought to mind The Orb, which was controversial to some people and therefore not something that could be used today, at scale, as a single means of identity verification for any app wanting to scale globally in a permissionless fashion.

This is not the case today, as anyone with a mobile device can use the WorldID App to provide unique credentials that would be orders of magnitude harder and more expensive to sybil attack than traditional methods like social authentication or anything on-chain, without the Orb.

Identity methods

There are 3 different authentication methods available through World ID

World ID Device

Allows apps to accept World IDs that haven't yet been verified with an Orb, but are held on trusted devices. This us useful in lower stake scenarios.

This is the method we’ll be using in this tutorial.

World ID Orb

Orbs are devices that verify humanness and uniqueness. The classic Orb signal allows users to prove they are holding an Orb-verified World ID on a trusted device. This is more secure than device only verification.

World ID Orb+

To enable higher security, apps can require face authentication when using World ID, similar to unlocking your phone. This ensures only the issued person can use their ID for that action. Authentication is performed locally on the user’s device thanks to Personal Custody.

Use Cases

User Verification

Use World ID to verify your existing user base if you already have users, or offer a way for users to sign up in another fashion.

User Sign in

Use World ID as an authentication method similar to how you might use Google , Facebook, or GitHub login.

User Actions

Incognito Actions verify arbitrary action a user might take in your app, where each action is independent of other actions. Here are some examples:

Claiming a one-time discount for a purchase (each discount is independent of other discounts)
Allowing users to vote in a poll, but only vote per user (each vote is independent, votes from the same person cannot be linked)
Sharing an invitation to join a platform, limited to five total invites

Similarly to how you might confirm or manage this type of authorization in a database, WorldID

What we’re building

We’ll be building a simple app that allows users to sign in to an app using their mobile device.

We’ll verify the user on the server using the WorldID API, and integrate it client-side using the World ID ID Kit SDK for React.

The user will also be given the option to also connect their wallet to fetch their social profile from Lens or Farcster using Airstack’s API.

We’ll be going the extra mile and adding some nice styling using the ShadCN UI library which is easy to set up with a new Next.js project.

Let’s start building.

Prerequisites

To follow along, you’ll need the following credentials.

When creating your World ID App, choose “Production” and “Cloud”:

After you create your World ID App ID, we’ll need to create a World ID incognito action. Set the Action Identifer to auth and set the Maximum Verifications Per Person to Unlimited.

We’ll be using both the App ID as well as the action later when we configure our environment variables.

Project setup

To get started, create a new Next.js app using the Next.js CLI:

npx create-next-app worldid-app

✔ Would you like to use TypeScript? Yes
✔ Would you like to use ESLint? Yes
✔ Would you like to use Tailwind CSS? Yes
✔ Would you like to use `src/` directory? No
✔ Would you like to use App Router? Yes
✔ Would you like to customize the default import alias (@/*)? No

Next, change into the new directory and install the following dependencies using npm, yarn, bun, or your favorite package manager:

npm install @worldcoin/idkit @airstack/node connectkit wagmi viem@2.x @tanstack/react-query next-themes

Next, add the following to tsconfig.json to disable the noImplicitAny TypeScript rule for the purposes of this tutorial:

"compilerOptions": {
  ...   
  "noImplicitAny": false
}

Finally, initialize ShadCN UI:

npx shadcn-ui@latest init

✔ Which style would you like to use? › New York
✔ Which color would you like to use as base color? › Slate
✔ Would you like to use CSS variables for colors? yes

Now, install the following three components using the ShadCN UI CLI:

npx shadcn-ui@latest add button sonner separator

Environment variables

Create a file named .env.local and add the following values:

AIRSTACK_API_KEY="your-airstack-api-key"
NEXT_PUBLIC_WALLETCONNECT_PROJECT_ID="your-walletconnect-project-id"

# WorldCoin ID Action Name
NEXT_PUBLIC_WC_ACTION="auth"

# Worldcoin ID App Name
# Remove "_app" from the WorldCoin App ID before setting
NEXT_PUBLIC_WLD_APP_ID="your-worldcoin-app-id"

API Routes

Next, we’ll create our API routes. These will be the back end API endpoints that we’ll be using to interact with the World ID API and the Airstack API.

World ID verification

The first route we’ll create is for verifying a user using the World ID API.

Create a folder in the app directory named verify. In the verify directory create a file named route.ts and add the following code:

//  app/api/verify/route.ts
import { NextResponse, NextRequest } from "next/server"

const endpoint = `https://developer.worldcoin.org/api/v1/verify/app_${process.env.NEXT_PUBLIC_WLD_APP_ID}`

export async function POST(req:NextRequest) {
  const body = await req.json()
  const reqBody = {
    merkle_root: body.merkle_root,
    nullifier_hash: body.nullifier_hash,
    proof: body.proof,
    verification_level: body.verification_level,
    signal: body.signal ?? "",
    action: process.env.NEXT_PUBLIC_WC_ACTION
  }

  const verifyRes = await fetch(endpoint, {
    method: "POST",
    headers: {
      "Content-Type": "application/json",
    },
    body: JSON.stringify(reqBody)
  })
  const wldResponse = await verifyRes.json()
  if (verifyRes.status == 200) {
    return NextResponse.json({ code: "success", wldResponse });
  } else {
    return NextResponse.json({ code: "failure", wldResponse });
  }
}

This route verifies the API call coming from the client, which includes the proof, nullifier hash, and merkle root. *see definition of types here.

The action is the action that was configured earlier, which allows this verification to be executed an unlimited number of times.

The signal is a string to be included in the zero-knowledge proof. This is typically used for on-chain actions, and we are optionally passing in a wallet address which could be verified on the server along with a signature for anything you’d like to do on-chain.

User lookup

We have an optional route at /api/get-user that calls the Airstack API with the user’s address to find out if they hold any web3 social accounts, if they do we can address them by their on-chain social profile name.

//  app/api/get-user/route.ts

import { NextRequest, NextResponse } from 'next/server'
import { init, fetchQuery } from "@airstack/node";

init(process.env.AIRSTACK_API_KEY || '')

export async function POST(req: NextRequest) {
  try {
    const { address } = await req.json()
    const query = `
      query GetAllSocials {
        Socials(
          input: {filter: {userAssociatedAddresses: {_eq: "${address}"}}, blockchain: ethereum}
        ) {
          Social {
            blockchain
            dappName
            profileName
            userAssociatedAddresses
            userId
            userCreatedAtBlockTimestamp
          }
        }
      }
    `
    const { data, error } = await fetchQuery(query)
    return NextResponse.json({ data, error})
  } catch (error) {
    console.log('error', error)
    return NextResponse.json({ error})
  }
}

Building the UI

We’re going to build the UI with some nice features like a light/dark mode as well as responsive styling using Tailwind breakpoints. Bear with me as this we will be working across quite a few files.

components/theme-provider.tsx

This lets us set up light / dark themeing

"use client"
 
import * as React from "react"
import { ThemeProvider as NextThemesProvider } from "next-themes"
import { type ThemeProviderProps } from "next-themes/dist/types"
 
export function ThemeProvider({ children, ...props }: ThemeProviderProps) {
  return <NextThemesProvider {...props}>{children}</NextThemesProvider>
}

components/mode-toggle.tsx

Lets us toggle between light and dark mode. This button will live in our nav.

"use client"

import * as React from "react"
import { MoonIcon, SunIcon } from "@radix-ui/react-icons"
import { useTheme } from "next-themes"

import { Button } from "@/components/ui/button"


export function ModeToggle() {
  const { setTheme, theme } = useTheme()
  const nextTheme = theme === "light" ? "dark" : "light"

  return (
    <Button onClick={() => setTheme(nextTheme)} variant="outline" size="icon" className="rounded-full">
      {
        theme === 'light' ? <SunIcon className="h-[1.2rem] w-[1.2rem] rotate-0 scale-100 transition-all dark:-rotate-90 dark:scale-0" /> : <MoonIcon className="absolute h-[1.2rem] w-[1.2rem] rotate-90 scale-0 transition-all dark:rotate-0 dark:scale-100" />
      } 
    </Button>
  )
}

components/connectkit-provider.tsx

This is where we configure Wagmi and ConnectKit

'use client'
import { WagmiProvider, createConfig } from "wagmi";
import { ConnectKitProvider as Provider, getDefaultConfig } from "connectkit";
import { QueryClient, QueryClientProvider } from "@tanstack/react-query";

const config = createConfig(
  getDefaultConfig({
    walletConnectProjectId: process.env.NEXT_PUBLIC_WALLETCONNECT_PROJECT_ID || "",
    appName: "WorldCoin ID Connect",
  }),
);

const queryClient = new QueryClient();

export const ConnectKitProvider = ({ children }: { children: any }) => {
  return (
    <WagmiProvider config={config}>
      <QueryClientProvider client={queryClient}>
        <Provider>
          {children }
        </Provider>
      </QueryClientProvider>
    </WagmiProvider>
  );
};

app/context.ts

This is the global state that we’ll be using in our app. This includes the user’s wallet address, social profile, and a function to set their profile (all optional as they can opt for only World ID verification).

import { createContext } from 'react'

export const AppContext = createContext({
  address: '',
  profile: null,
  setProfile: () => null,
})

app/context-provider.tsx

This is the component that allows us to update and manage the global state configured in app/context.ts

'use client'
import { AppContext } from './context'
import { useAccount } from "wagmi"
import { useState, useEffect } from 'react'

export function ContextProvider ({ children }) {
  const { address } = useAccount()
  const [profile, setProfile] = useState<any>(null)
  const [verified, setVerified] = useState(false)

  useEffect(() => {
    if (address) fetchProfile()
  }, [address])

  async function fetchProfile() {
    try {
     const response = await fetch('/api/get-user', {
      method: 'POST',
      body: JSON.stringify({ address }),
     }).then((res) => res.json())
     if (response?.data?.Socials?.Social) {
        setProfile(response.data.Socials.Social[response.data.Socials.Social.length - 1])
     } else {
      setProfile({
        profileName: address
      })
     }
    } catch (err) {
      console.log('error fetching profile', err)
    }
  }
  
  return (
    <AppContext.Provider
      value={{
        address,
        profile,
        setProfile,
        setVerified,
        verified
      } as any }
    >
      {children}
    </AppContext.Provider>
  )
}

app/nav.tsx

This is the navigation bar.

'use client'
import Image from 'next/image'
import Link from 'next/link'
import { ModeToggle } from '@/components/mode-toggle'
import { AppContext } from './context'
import { useContext, useState, useEffect } from 'react'
import { Button } from '@/components/ui/button'
import { useDisconnect } from 'wagmi'
import { GitHubLogoIcon } from '@radix-ui/react-icons'
export function Nav() {
  const [isClient, setIsClient] = useState(false)
  const { address, setProfile, setVerified, verified } = useContext<any>(AppContext)
  const { disconnect } = useDisconnect()
  useEffect(() => {
    setIsClient(true)
  }, [])
  if (!isClient) return null
  return (
    <div
      className="flex border-b px-6 py-3"
    >
      <div className='flex flex-1 items-center'>
        <Link href="/">
          <div className='flex items-center'>
            <Image
              src="/worldcoin.svg"
              width={40}
              className="dark:invert"
              height={40}
              alt="Worldcoin Logo"
            />
              <p className='md:ml-1 text-sm'>
              <span className='font-bold'>World</span> Auth
            </p>
          </div>
        </Link>
        <Link
          href="https://worldcoin.org/world-id"
          target="_blank"
          rel="noopener noreferrer"
        >
          <p
            className='
            hover:text-foreground md:ml-8
            ml-5 text-sm text-muted-foreground
            '
          >
            Learn More
          </p>
        </Link>
        <Link
          href="https://github.com/dabit3/worldid-auth"
          target='_blank'
          rel="noopener noreferrer"
        >
          <GitHubLogoIcon
            className="md:ml-8 ml-5 hover:text-foreground text-muted-foreground"
          />
        </Link>
      </div>
      <div className='flex items-center'>
        {
          (verified || address) && (
            <Button onClick={
              () => {
                disconnect()
                setProfile(null)
                setVerified(false)
              }
            } className='text-xs' variant='link'>
              Disconnect
            </Button>
          )
        }
        <ModeToggle />
      </div>
    </div>
  )
}

For the WorldCoin logo, create worldcoin.svg in the public folder and add this:

<?xml version="1.0" encoding="utf-8"?>
<!-- Generator: Adobe Illustrator 26.4.1, SVG Export Plug-In . SVG Version: 6.00 Build 0)  -->
<svg version="1.1" id="katman_1" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" x="0px" y="0px"
	 viewBox="0 0 445.2 315.2" style="enable-background:new 0 0 445.2 315.2;" xml:space="preserve">
<path
class="st0" d="M327.6,115.2c-3-7.5-6.8-14.6-11.3-21.3c-20.3-30-54.7-49.7-93.6-49.7c-62.4,0-112.9,50.6-112.9,112.9
	c0,62.4,50.6,113,112.9,113c39,0,73.3-19.7,93.6-49.7c4.5-6.6,8.2-13.7,11.3-21.2c5.2-13,8.1-27.2,8.1-42
	C335.6,142.4,332.8,128.2,327.6,115.2z M312.5,145.7H183.2c2-7,5.7-13.2,10.7-18.1c7.6-7.6,18.1-12.3,29.7-12.3H303
	C307.9,124.6,311.1,134.8,312.5,145.7z M222.1,66.1c25.7,0,49,10.7,65.6,27.9h-61.3c-17.5,0-33.3,7.1-44.7,18.5
	c-8.9,8.9-15.1,20.3-17.4,33.2h-32.5C137.4,100.8,175.7,66.1,222.1,66.1z M222.1,248.4c-46.4,0-84.7-34.7-90.4-79.6h32.5
	c5.4,29.4,31.2,51.7,62.2,51.7h61.3C271.2,237.7,247.9,248.4,222.1,248.4z M223.6,199.3c-19.2,0-35.4-12.9-40.4-30.5h129.3
	c-1.4,10.9-4.7,21.1-9.5,30.5H223.6z"/>
</svg>

app/layout.tsx

This is the component that configures the nav, global state, and providers we’ve created and passes everything down to be available in our routes. You can think of this as the entrypoint to our app.

import type { Metadata } from 'next'
import { Inter } from 'next/font/google'
import { Nav } from './nav'
import { ConnectKitProvider } from '@/components/connectkit-provider'
import { ThemeProvider } from "@/components/theme-provider"
import { ContextProvider } from './context-provider'
import './globals.css'

const inter = Inter({ subsets: ['latin'] })
export const metadata: Metadata = {
  title: 'World ID Auth',
  description: 'Example of World ID device authentication',
}

export default function RootLayout({
  children,
}: {
  children: React.ReactNode
}) {
  return (
    <html lang="en" suppressHydrationWarning>
      <body className={inter.className} style={{ height: 'calc(100vh - 60px)' }}>
        <ThemeProvider
          attribute="class"
          defaultTheme="dark"
          enableSystem
          disableTransitionOnChange
        >
          <ConnectKitProvider>
            <ContextProvider>
              <Nav />
              {children}
            </ContextProvider>
          </ConnectKitProvider>
        </ThemeProvider>
      </body>
    </html>
  )
}

app/page.tsx

The last component we need to set up is the main view, located at app/page.tsx.

This is a large file, you can also view the code here on GitHub

This component is the only route we’ll be rendering in our app, and contains all of the functionality we’ll need to make this work.

'use client'
import { CubeIcon } from '@radix-ui/react-icons'
import { Button } from '@/components/ui/button'
import { Separator } from '@/components/ui/separator'
import { useModal } from "connectkit"
import { useAccount } from "wagmi";
import { useState, useEffect, useContext, useRef } from 'react'
import { IDKitWidget, VerificationLevel } from '@worldcoin/idkit'
import { Toaster } from "@/components/ui/sonner"
import { toast } from "sonner"
import { AppContext } from './context'
import Link from 'next/link'

export default function Home() {
  const { address } = useAccount()
  const [isLoaded, setIsLoaded] = useState(false)
  const { setOpen } = useModal()
  const { profile, verified, setVerified } = useContext<any>(AppContext)
  const buttonRef = useRef<any>(null)

  useEffect(() => {
    setIsLoaded(true)
  }, [])

  useEffect(() => {
    if (address && !verified) {
      buttonRef.current?.click()
    }
  }, [address])

  async function handleVerify(data: any) {
    data.signal = address
    const response = await fetch('/api/verify', {
      method: 'POST',
      headers: {
        'Content-Type': 'application/json'
      },
      body: JSON.stringify(data)
    }).then(res => res.json())
    if (response.code === 'success') {
      setVerified(true)
      toast("Successfully authenticated with World ID.")
    } else {
      toast("Authenticated failed with World ID.")
      console.log('error:', response.wldResponse)
    }
  }
  if (!isLoaded) return null

  return (
    <main className="
    p-4 sm:p-12 flex flex-1 h-full">
      <div className="
      rounded-xl border flex-col md:flex-row flex flex-1
      ">
       <div
       className='
       p-6 sm:p-8 rounded-tl-xl rounded-tr-xl md:rounded-tr-none
       md:rounded-tl-xl md:rounded-bl-xl flex flex-1 bg-secondary-100 bg-zinc-900'
       >
        <div className='flex items-start flex-col'>
          <div className='flex justify-center flex-1 '>
            <CubeIcon
             className='w-7 h-7 text-white' />
            <p className='
            text-white
            ml-3 text-lg'>Acme Inc</p>
          </div>
          <div>
           <p className='text-white'>
           Sybil-resistant identity verification powered by <Link rel='no-opener' target="_blank" href="https://docs.worldcoin.org/">World ID</Link>.
           </p>
          </div>
        </div>
       </div>
       {
        verified && (
          <div
          className='
          border-l items-center justify-center
          flex flex-1 flex-col p-6 sm:p-8 '
          >
            <p>
              🎉 &nbsp;Successfully authenticated with World ID.
            </p>
            {
              address && profile && (
                <p className='text-sm mt-3 text-muted-foreground'>
                  welcome, {profile.profileName}
                </p>
              )
            }
          </div>
        )
       }

      {
        !verified && (
          <div
          className='
          md:rounded-bl-none rounded-bl-xl border-l
          items-center justify-center flex flex-1 flex-col p-6 sm:p-8 '
          >
            <div className='flex flex-col items-center'>
              <p className='font-medium text-2xl'>Login or create account</p>
              <p className='text-sm text-muted-foreground mt-2'>
              Connect and verify with World ID.
              </p>
            </div>
            <div className='mt-5 flex flex-col'>
              <IDKitWidget
                app_id={`app_${process.env.NEXT_PUBLIC_WLD_APP_ID}`}
                action={process.env.NEXT_PUBLIC_WC_ACTION || ''}
                onSuccess={(message) => console.log(message)}
                handleVerify={handleVerify}
                signal={address}
                verification_level={VerificationLevel.Device}
              >
                {({ open }) => (
                  <Button
                    ref={buttonRef}
                    className='w-[290px] sm:w-[320px] '
                    onClick={open}
                  >Verify with World ID</Button>
                )}
              </IDKitWidget>
              {
                !address && (
                  <>
                    <div className='relative py-4'>
                      <Separator
                        className='
                          mt-6 absolute inset-0 flex items-center
                        '
                      />
                      <div className='
                       relative flex justify-center text-xs uppercase
                      '>
                        <p className='
                          bg-background px-2 text-muted-foreground
                        '>
                          or
                        </p>
                      </div>
                    </div>
                    <Button
                    onClick={() => setOpen(true)}
                    variant="outline" className='
                    w-[290px] sm:w-[320px] 
                    rounded-lg'>
                      Verify with Wallet & World ID
                    </Button>
                  </>
                )
              }
              <Link
                href="https://apps.apple.com/no/app/world-app-worldcoin-wallet/id1560859847"
                target='_blank'
                rel='noopener noreferrer'
              >
                <p className='
                  text-muted-foreground text-xs mt-4 text-center
                '>
                  Get the WorldID App -&gt;
                </p>
              </Link>
            </div>
          </div>
        )
      }
      </div>
      <Toaster />
    </main>
  )
}

The main thing to call out in the above code is the IDKitWidget component.

This is the component that renders and configures our interaction with our server. The handleVerify function is what you should look at to understand how the API call and the response are handled.

Testing it out

That’s it! You should be able to test it out by running npm run dev.

View a live demo here, and the final codebase here.

Introducing React Native AI

React Native AI is a full stack framework for building cross-platform mobile AI apps

Nader Dabit

Nov 27, 2023

React Native AI is a full stack framework for building cross-platform mobile AI apps supporting LLM real-time / streaming text and chat UIs, image services + natural language to image with multiple models, and image processing.

Check out the social announcement here, and the repo here. Please give it a ⭐️ if you think it’s cool!
Video tutorial is also now available here.

I’ve spent the past few months building and launching various AI experiments and products like RoamAround ($10M valuation, 7 million+ users) and AI Buddy ($1,000+ MRR, 2,000 + users).

During that time I’ve learned a lot about how to piece together everything needed to build and launch successful AI apps, lately with a focus on mobile and launching to the App stores.

What I realized was that the base needed to build most of my ideas was the same, but experimenting took a long time because I needed to rebuild these basic components over and over.

React Native AI came out of my need to have a basic, configurable framework to easily spin up new ideas without having to rebuild all of the basic components from scratch.

Some of these components and features:

Streaming / real-time responses. Handling and debugging these on both the client and the server can be time-consuming.
LLM normalization. The various LLM providers respond in different formats. The idea is to handle these on the server and send back to the client with a consistent interface.
OpenAI Assistants. OpenAI Assistants provide a powerful new way to build on LLMs, but the API for doing so is more complex than using a basic LLM. There needs to be a string of API calls and state management for this to work. Having this all set up and ready to use will save me a lot of time in the future.
Image processing. Dealing with image uploads and downloads, saving to disk on the server and saving to the device, can be complicated and gets time consuming to debug. Having a plug and play interface and utility functions speeds everything up.
Chat UI. Rebuilding components like buttons, text inputs, and chat interfaces is time consuming and tedious. Having a pre-built UI and components for these repeatable UI elements saves a lot of time.
Theming / styling. React Native AI comes with 5 prebuilt themes and makes swapping themes out very simple, by adding just a few lines of code. When launching your app, you can ship with just a single theme or give users the option to switch between themes.

Service providers

React Native AI uses the following services:

OpenAI ChatGPT for 2 types of LLMs as well as OpenAI Assistant, code interpreter, and retrieval
Fal.ai for an array of image models.
Anthropic Claude for 2 types of LLMs
Cohere for 2 types of LLMs including web which gives you access to live, online data
Bytescale for image processing

Other technology

React Native AI is made possible by great open source tech like React Native, Expo, Node.js, and Express.

Getting started

To get started with React Native AI, run the following command:

npx rn-ai

You’ll be prompted to configure your environment variables either at setup or later. You don’t have to set up all environment variables, you only need to enable the services you’d like to try out.

Loading more posts…