Skip to content

RamboRogers/MLX-Transcribe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽค MLX-Transcribe

MLX-Transcribe Demo

*Press "Control + ~" โ†’ Speak โ†’ Text appears instantly at your cursor*


๐Ÿš€ Lightning-fast AI transcription directly at your cursor - The fastest way from voice to text

License macOS Architecture GitHub stars

๐ŸŽฏ What is MLX-Transcribe?

MLX-Transcribe is a NATIVE SIGNED privacy-first, lightning-fast transcription app for macOS that transforms your voice into text instantly at your cursor position. Built with Apple Silicon optimization and seamless integration with MLX-GUI, it's the perfect tool for developers, writers, and anyone who values speed and privacy.

MLX-Transcribe

๐Ÿ”ฅ Key Features

  • โšก Instant Transcription - Press Control + ` and speak - text appears instantly at your cursor
  • ๐Ÿค– Local AI First - Seamless MLX-GUI integration with automatic model management
  • ๐Ÿ›ก๏ธ Privacy Focused - Your voice never leaves your machine with local processing
  • ๐ŸŽ›๏ธ Multi-API Support - Works with Local MLX, OpenAI, and Groq endpoints
  • โŒจ๏ธ Customizable Hotkeys - Configure any key combination that works for you
  • ๐ŸŽฏ Universal Cursor - Works in any app - text editors, browsers, chat apps, terminals
  • ๐Ÿš€ Menu Bar Native - Lightweight, always-accessible menu bar app
  • ๐Ÿ”„ Auto-Install Models - Automatically downloads and loads Parakeet models via MLX-GUI
  • ๐Ÿƒโ€โ™‚๏ธ Startup Ready - Optional login item for seamless workflow integration

๐Ÿš€ Quick Install

  1. Download the latest release from Releases
  2. Open the .zip file and drag MLX-Transcribe to Applications
  3. Launch and grant accessibility permissions when prompted

โš™๏ธ Setup

1. ๐ŸŽค Microphone Permission

MLX-Transcribe will automatically request microphone access on first launch.

2. ๐Ÿ”“ Accessibility Permission

For text insertion at cursor:

  1. Click "Grant Accessibility" in the menu bar popup
  2. Add MLX-Transcribe to Privacy & Security โ†’ Accessibility
  3. Restart the app after granting permission

3. ๐Ÿค– AI Configuration

  • Local MLX (Default): Integrates with MLX-GUI - models auto-install
  • OpenAI: Add your API key in settings
  • Groq: Add your Groq API key for fast cloud transcription

๐ŸŽฎ Usage

Basic Transcription

  1. Press Control + ~ (or your custom hotkey)
  2. Speak clearly while you see the red recording indicator
  3. Press Control + ~ to stop recording, or Control + ~ again
  4. Text appears instantly at your cursor position

Menu Bar Controls

Click the ๐ŸŽค icon in your menu bar to:

  • View transcription status
  • Configure hotkeys and API settings
  • Enable "Start on Login"
  • Check permissions and connection status

Pro Tips

  • Best Results: Speak clearly and pause briefly between sentences
  • Quick Stop: Press Control + ~ while recording to stop immediately
  • Works Everywhere: Terminal, VS Code, Slack, browsers, documents
  • Privacy Mode: Use local MLX processing to keep everything offline

๐Ÿ› ๏ธ API Configurations

API Configurations

๐Ÿ  Local MLX (Recommended)

Endpoint: http://127.0.0.1:8000/v1/audio/transcriptions
Model: parakeet-tdt-0-6b-v2 (auto-installed)
API Key: Not required

โ˜๏ธ OpenAI

Endpoint: https://api.openai.com/v1/audio/transcriptions
Model: whisper-1
API Key: Required (sk-...)

โšก Groq

Endpoint: https://api.groq.com/openai/v1/audio/transcriptions
Model: whisper-large-v3-turbo
API Key: Required (gsk_...)

๐Ÿ–ฅ๏ธ System Requirements

  • macOS: 12.0 (Monterey) or later
  • Architecture: Intel x86_64 or Apple Silicon (arm64)
  • Memory: 4GB RAM minimum, 8GB recommended
  • Storage: 50MB for app, additional space for local models
  • For Local MLX: MLX-GUI installed and running

๐Ÿ”ง Advanced Configuration

Custom Hotkeys

Popular combinations:

  • Control + ~ (Default) - Great for developers
  • Shift + Tab - Alternative option
  • Option + Space - Quick access
  • F5 - Function key option

Keyboard Shortcuts

  • Start/Stop Recording: Your configured hotkey (default: Control + ~)
  • Force Stop: Control + ~ (while recording)
  • Open Menu: Click menu bar icon

๐Ÿค Contributing

We welcome contributions! Here's how you can help:

  1. ๐Ÿ› Report Issues: Found a bug? Open an issue
  2. ๐Ÿ’ก Feature Requests: Have an idea? We'd love to hear it!
  3. ๐Ÿ”ง Code Contributions: Fork, branch, commit, and PR
  4. ๐Ÿ“– Documentation: Help improve our docs and examples
  5. โญ Star the Repo: Show your support!

๐Ÿ“Š Performance

Metric Local MLX OpenAI Groq
Latency ๐ŸŸข ~2-4s ๐ŸŸก ~3-8s ๐ŸŸข ~1-3s
Privacy ๐ŸŸข 100% Local ๐Ÿ”ด Cloud ๐Ÿ”ด Cloud
Accuracy ๐ŸŸข Excellent ๐ŸŸข Excellent ๐ŸŸข Excellent
Cost ๐ŸŸข Free ๐ŸŸก $0.006/min ๐ŸŸก $0.00011/min

โญ Star History

Star History Chart

๐Ÿ”— Related Projects

๐Ÿ“œ License

Freeware License - see LICENSE file for details.

๐Ÿ™ Acknowledgments

  • Built with โค๏ธ by Matthew Rogers
  • Powered by MLX-GUI for local AI processing
  • Uses Apple's Speech Recognition framework
  • Inspired by the need for privacy-first transcription tools

Made with ๐ŸŽค and โšก for the developer community

MLX-Transcribe - Where voice meets code at the speed of thought

About

Instant Perfect Native MacOS Transcription

Topics

Resources

License

Stars

Watchers

Forks

Packages