Polka Codes - AI-Powered Coding Assistant

Note This project is under active development. APIs and features may evolve, but we strive to maintain backward compatibility when possible.

Polka Codes is a powerful TypeScript-based AI coding assistant framework that helps developers with task planning, code generation, and more. It uses natural language interactions, a multi-agent system, a command-line interface, and seamless GitHub integration to streamline your development workflow.

✨ Features

📝 Task Planning: Create detailed implementation plans for your tasks using the plan command
💻 Code Generation: Implement features from a plan or a simple description with the code command
🐛 Automated Debugging: Automatically fix failing tests or commands with the fix command
🔍 Code Review: Get AI-powered feedback on your pull requests and local changes with automatic fix application
🤖 AI-Assisted Git Workflow: Generate commit messages and create pull requests with the commit and pr commands
💾 Checkpoint & Resume: Save and restore work state with the checkpoints and continue commands
🎯 Custom Scripts: Define and execute custom TypeScript scripts and shell commands with the run and init commands
🤝 Multi-Agent System: Specialized AI agents (Architect, Coder, etc.) collaborate on complex tasks
🔧 Simple Setup: Quickly initialize your project configuration with init
🔄 GitHub Integration: GitHub Actions integration for PR/issue mentions
📦 Extensible Architecture: Modular design for adding AI providers, tools, and agents
⚡ Type Safety: Fully typed with TypeScript for better DX
🧪 Thoroughly Tested: 800+ tests using bun:test with snapshot testing
🤖 Multiple AI Providers: Anthropic Claude, Google Vertex, DeepSeek, OpenAI, OpenRouter, Ollama
🔌 MCP Support: Consume tools from external MCP servers and expose workflows via MCP server
🚀 Auto-Fix: Automatically detect and fix lint/type errors after code changes

Quick Start

Installation

# Install globally using npm
npm install -g @polka-codes/cli

# Or run directly using npx
npx @polka-codes/cli "your task description"

Example Workflow

Here's an example of a typical development workflow using Polka Codes:

# 1. Create a detailed implementation plan for a task
polka plan --plan-file auth.plan.md "Implement JWT-based auth"

# 2. Implement the feature based on the plan
polka code --file auth.plan.md

# 3. Fix any issues that arise (e.g., failing tests)
polka fix "bun test"

# 4. Commit your changes with an AI-generated message
polka commit -a

# 5. Create a pull request
polka pr

Code Review

The review command provides AI-powered code reviews for your projects. It can be used to review GitHub pull requests or local changes. The command can also attempt to automatically fix the issues it finds. Use the --loop option to have it re-review its own changes until the review is clean or the loop limit is reached.

Reviewing Pull Requests

To review a pull request, use the --pr option with the pull request number. This feature requires the GitHub CLI (gh) to be installed and authenticated.

# Review a pull request by number
polka review --pr 123

# Review and automatically apply feedback in a loop (up to 3 times)
polka review --pr 123 --loop 3

Reviewing Local Changes

To review local changes, run the review command without any arguments. It will automatically detect changes in the following order:

Staged changes (git diff --staged)
Unstaged changes (git diff)

If no local changes are found, it will fall back to reviewing the diff between the current branch and the repository's default branch (e.g., main or master). This also requires gh.

# Review staged or unstaged changes
polka review

# Review and apply feedback
polka review --yes

JSON Output

For programmatic use, you can get the review in JSON format by adding the --json flag.

polka review --json

For more information, see the CLI README.

Custom Scripts

Polka Codes supports defining and executing custom automation scripts in your .polkacodes.yml configuration file. This allows you to create reusable TypeScript scripts and shell commands for common development tasks.

Script Types

You can define four types of scripts:

Simple Shell Command: A quick one-liner
Command with Description: A shell command with metadata
TypeScript Script: An in-process TypeScript file with full access to Polka Codes APIs
Workflow Script: A reference to a workflow YAML file

Configuration

Add scripts to your .polkacodes.yml:

scripts:
  # Simple shell command
  test: bun test

  # Command with description
  lint:
    command: bun run lint
    description: Run linter

  # TypeScript script
  deploy:
    script: .polka-scripts/deploy.ts
    description: Deploy to production
    timeout: 300000  # 5 minutes
    permissions:
      fs: write
      network: true

Running Scripts

# List all available scripts
polka run --list

# Run a specific script
polka run deploy

# Run with arguments
polka run deploy -- --production --force

# Quick execution - runs script directly
polka test  # Runs the 'test' script

Creating TypeScript Scripts

Generate a new script template:

polka init script my-script

This creates .polka-scripts/my-script.ts with a template:

import { code, commit } from '@polka-codes/cli'

export async function main(args: string[]) {
  console.log('Running script: my-script')
  console.log('Arguments:', args)

  // Your automation here
  // await code({ task: 'Add feature', interactive: false })
  // await commit({ all: true, context: 'Feature complete' })

  console.log('Script completed successfully')
}

if (import.meta.main) {
  main(process.argv.slice(2))
}

Script Permissions

TypeScript scripts support declaring permissions (currently advisory for future sandboxing):

scripts:
  risky-script:
    script: .polka-scripts/risky.ts
    permissions:
      fs: write        # read, write, or none
      network: true     # true or false
      subprocess: true  # true or false
    timeout: 60000      # Execution timeout in milliseconds
    memory: 512         # Memory limit in MB (64-8192)

Important: TypeScript scripts currently run in-process with full permissions. Permission declarations are advisory only for future sandboxing implementation.

Built-in Commands

The following command names are reserved and cannot be used for custom scripts:

code, commit, pr, review, fix, plan, workflow, run, init, meta

Checkpoint & Resume

Polka Codes automatically creates checkpoints during agent runs, allowing you to save your work state and resume later.

Listing Checkpoints

View all available checkpoints:

polka checkpoints

Output example:

Found 2 checkpoint(s):

1. feature-complete - Jan 4, 2025, 3:00:00 PM (5m ago)
   Completed: 10 task(s)

2. initial-state - Jan 4, 2025, 2:00:00 PM (1h 5m ago)
   Queue: 15 task(s)
   Next: Implement user authentication

Resume with:
  bun polka continue feature-complete

For more details:

polka checkpoints --verbose

Resuming from Checkpoints

Resume from the most recent checkpoint:

polka continue

Resume from a specific checkpoint:

polka continue feature-complete

The continue command will show you:

What tasks were completed
What tasks failed
What tasks are pending
Instructions to manually restore and continue

Note: Full automatic resume is planned for a future release. Currently, the command provides clear instructions for manual restoration.

Working Directory

Checkpoints are stored in your project's .polka/agent-state/checkpoints/ directory. Each checkpoint contains:

Session ID and timestamp
Task queue status
Completed and failed tasks
Execution history
Metrics and performance data

MCP Integration

Polka Codes supports the Model Context Protocol (MCP) for both consuming external tools and exposing your own workflows as MCP tools.

Consuming MCP Tools

You can configure external MCP servers in your .polkacodes.yml to make their tools available to AI agents:

mcpServers:
  filesystem:
    command: npx
    args: ["-y", "@modelcontextprotocol/server-filesystem", "/allowed/path"]
    tools:
      read_file: true
      write_file: true
    # Optionally configure specific AI provider for tools
    # provider: anthropic
    # model: claude-3-5-sonnet-20241022

  database:
    command: /path/to/custom-server
    env:
      DATABASE_URL: "postgresql://..."

When MCP servers are configured, their tools are automatically exposed to AI agents in the code, fix, plan, and task workflows. Tools are named with the pattern {serverName}/{toolName} (e.g., filesystem/read_file).

Exposing Workflows via MCP Server

Polka Codes can run as an MCP server, exposing your high-level workflows to Claude Code.

Start the MCP server:

polka mcp-server

The server exposes these workflow tools:

code: Execute coding tasks
review: Review code changes
plan: Create implementation plans
fix: Fix failing tests or commands
commit: Generate commit messages

Configuration with Claude Code:

Add the MCP server to your Claude Code configuration:

claude mcp add polka polka mcp-server

Or manually edit ~/.claude/config.json:

{
  "mcpServers": {
    "polka": {
      "command": "polka",
      "args": ["mcp-server"]
    }
  }
}

For detailed MCP usage instructions and configuration options, see MCP_GUIDE.md.

Project Structure

The project is organized as a monorepo with the following packages:

Package	Description
`core`	Core AI services, agent implementations, and tooling.
`cli`	Command-line interface for interacting with AI services.
`cli-shared`	Shared utilities and types for CLI packages.
`github`	GitHub integration, including the GitHub Action.
`runner`	Service for running agents and managing tasks.

Getting Started

Prerequisites

Bun (v1.0.0 or higher)
ripgrep (required for the file search tool)

Development Setup

# Clone the repository
git clone https://github.com/polka-codes/polka-codes.git
cd polka-codes

# Install dependencies
bun install

Available Scripts

bun build: Build all packages.
bun lint: Check for linting and formatting errors.
bun fix: Automatically fix linting and formatting issues.
bun check: Run type checking and linting.
bun typecheck: Run type checking only.
bun test: Run tests across all packages.
bun clean: Remove build artifacts.
bun cli: Run the CLI in development mode.
bun pr: A shortcut for bun cli pr.
bun commit: A shortcut for bun cli commit.
bun codegen: Generate GraphQL types for the GitHub package.

Contributing

If you're contributing to this project, please refer to AGENTS.md for:

Comprehensive project architecture and workflow system documentation
Code conventions and development guidelines
Tool system patterns and multi-agent architecture
Important implementation notes

Configuration

A .polkacodes.yml configuration file can be used to customize the behavior of polka-codes. An example configuration file is provided in the repository as example.polkacodes.yml.

For detailed configuration options, refer to the example file, which includes comprehensive comments for each setting.

Tool Format

You can specify the format for tool integration using the toolFormat option in your .polkacodes.yml file. This setting determines how the AI model interacts with the available tools.

native: Uses the model's native tool-use capabilities via the Vercel AI SDK. More efficient and leads to better results, but not supported by all models. Check your model provider's documentation for compatibility.
polka-codes (default): Uses a custom XML-based format for tool calls. Designed to be compatible with a wide range of models but may consume more tokens compared to the native format.

You can set the toolFormat globally or for specific agents or commands.

Example:

# .polkacodes.yml
toolFormat: "native"

Environment Variables

The following environment variables can be used to configure Polka Codes. Note that command line options take precedence over environment variables, which in turn take precedence over the configuration file.

Variable	Description
`POLKA_API_PROVIDER`	Specify the default AI service provider.
`POLKA_MODEL`	Specify the default AI model to use.
`POLKA_API_KEY`	Default API key for the selected provider.
`POLKA_BUDGET`	Set the budget limit for AI service usage (defaults to 1000).
`ANTHROPIC_API_KEY`	API key for Anthropic Claude service.
`OPENROUTER_API_KEY`	API key for OpenRouter service.
`DEEPSEEK_API_KEY`	API key for DeepSeek service.

License

This project is licensed under the AGPL-3.0 License.

Credits

This project is heavily inspired by the Cline project.

Generated by polka.codes

Name		Name	Last commit message	Last commit date
Latest commit History 934 Commits
.github		.github
.husky		.husky
docs		docs
examples		examples
packages		packages
plans		plans
scripts		scripts
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.polkacodes.yml		.polkacodes.yml
AGENTS.md		AGENTS.md
LICENSE		LICENSE
MCP_GUIDE.md		MCP_GUIDE.md
README.md		README.md
biome.jsonc		biome.jsonc
bun.lock		bun.lock
bunfig.toml		bunfig.toml
ci.polkacodes.yml		ci.polkacodes.yml
example.polkacodes.yml		example.polkacodes.yml
package.json		package.json
tsconfig.json		tsconfig.json

License

polka-codes/polka-codes

Folders and files

Latest commit

History

Repository files navigation

Polka Codes - AI-Powered Coding Assistant

✨ Features

Quick Start

Installation

Example Workflow

Code Review

Reviewing Pull Requests

Reviewing Local Changes

JSON Output

Custom Scripts

Script Types

Configuration

Running Scripts

Creating TypeScript Scripts

Script Permissions

Built-in Commands

Checkpoint & Resume

Listing Checkpoints

Resuming from Checkpoints

Working Directory

MCP Integration

Consuming MCP Tools

Exposing Workflows via MCP Server

Project Structure

Getting Started

Prerequisites

Development Setup

Available Scripts

Contributing

Configuration

Tool Format

Environment Variables

License

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 162

Packages 0

Uh oh!

Contributors 8

Uh oh!

Languages

Packages