Skip to content

Agent CLI

A collection of local-first, AI-powered command-line agents that run entirely on your machine.

agent-cli logo

What is Agent CLI?

Agent CLI provides a suite of powerful tools for voice and text interaction, designed for privacy, offline capability, and seamless integration with system-wide hotkeys and workflows.

Tip

Short aliases available: You can use agent or ag instead of agent-cli for convenience.

Important

Local and Private by Design

All agents can run 100% locally. Your data—whether from your clipboard, microphone, or files—stays on your machine unless you configure a cloud provider. This keeps workflows private and allows the tools to work offline.

You can optionally configure the agents to use OpenAI/Gemini services.

Quick Demo

Why I built this

I got tired of typing long prompts to LLMs. Speaking is faster, so I built this tool to transcribe my voice directly to the clipboard with a hotkey.

What it does:

  • Voice transcription to clipboard with system-wide hotkeys (Cmd+Shift+R on macOS)
  • Autocorrect any text from your clipboard
  • Edit clipboard content with voice commands ("make this more formal")
  • Runs locally - no internet required, your audio stays on your machine
  • Works with any app that can copy/paste

I use it mostly for the transcribe command when working with LLMs. Being able to speak naturally means I can provide more context without the typing fatigue.

Since then I have expanded the tool with many more features, all focused on local-first AI agents that integrate seamlessly with your system.

Features

Command Description
autocorrect Correct grammar and spelling using a local LLM
transcribe Transcribe audio from your microphone to clipboard
transcribe-daemon Continuous background transcription with VAD
speak Convert text to speech using a local TTS engine
voice-edit Edit clipboard text with voice commands
assistant Wake word-based voice assistant
chat Conversational AI with tool-calling capabilities
rag-proxy Chat with your documents via RAG
memory Long-term memory system for conversations
dev Parallel development with git worktrees and AI coding agents
server Local Whisper ASR server and transcription proxy

Quick Start

Just want the CLI tool?

If you already have AI services running (or plan to use OpenAI):

# Using uv (recommended)
uv tool install agent-cli

# Using pip
pip install agent-cli

Then use it:

agent-cli autocorrect "this has an eror"

Want automatic setup with everything?

# 1. Install agent-cli
uv tool install agent-cli

# 2. Install all required services
agent-cli install-services

# 3. Start all services
agent-cli start-services

# 4. (Optional) Set up system-wide hotkeys
agent-cli install-hotkeys

# 5. Use it!
agent-cli autocorrect "this has an eror"

The setup automatically installs:

  • ✅ Package managers (Homebrew/uv) if needed
  • ✅ All AI services (Ollama, Whisper, TTS, etc.)
  • ✅ The agent-cli tool
  • ✅ System dependencies
  • ✅ Hotkey managers (if using hotkey scripts)

Requirements

  • uv (recommended) or Python 3.11+
  • A microphone (for voice features)
  • Speakers (for text-to-speech features)

Documentation

License

MIT