Connect Claude, Cursor, Windsurf & AI Assistants to Your Speak AI Workspace
81 tools, 5 resources, 3 prompts, and 26 CLI commands. Transcribe, analyze, search, and manage media at scale from any MCP-compatible AI assistant or your terminal. Free and open source.
Get started in 2 minutes
Auto-setup detects your installed MCP clients and configures them. Or connect via remote URL with zero installation.
Get your API key
Sign up at app.speakai.co and copy your API key from account settings. Full access during the trial.
Install and configure
Install globally via npm, then run the auto-setup wizard. It detects Claude Desktop, Cursor, Windsurf, VS Code, and other MCP clients on your machine.
Start using 81 tools
Ask your AI assistant to transcribe a file, search across your media library, or pull insights from a meeting. Or use the CLI to script and automate from your terminal.
npm install -g @speakai/mcp-server
# Auto-detect and configure MCP clients
speakai-mcp init
# Or use the CLI directly
speakai-mcp config set-key
speakai-mcp upload ./meeting-recording.mp4 –wait
speakai-mcp ask “What were the action items?”
81 tools across 12 categories
Every Speak AI capability is available as an MCP tool. Your AI assistant can chain tools together for multi-step workflows: upload a recording, wait for processing, pull the transcript, extract action items, and export to PDF in a single conversation.
14 tools
Media
Upload from URL or local file, transcribe, get insights, export captions, update metadata, reanalyze with latest models. Upload-and-analyze in one call.
12 tools
Magic Prompt / AI Chat
Ask AI questions about any media, folder, or your entire workspace. Get chat history, manage favorites, export answers, and view usage statistics.
11 tools
Folders & Views
Create, clone, and organize folders. Save custom views with filters and sorting. Manage your media library structure programmatically.
10 tools
Recorder & Survey
Create embeddable recorders and surveys. Manage questions, branding, permissions. List submissions and generate shareable public URLs.
5 tools
Automations
Create and manage automation rules. Toggle automations on or off. Trigger workflows based on events in your workspace.
4 tools
Clips
Create highlight clips from time ranges across media files. Update, tag, and share clips. Pull the best moments from long recordings.
4 tools
Custom Fields
Define and manage custom metadata fields. Batch update fields across multiple media items for structured data workflows.
4 tools
Webhooks
Create webhook endpoints for real-time event notifications. Update, list, and manage webhooks for event-driven integrations.
4 tools
Meeting Assistant
Schedule the AI assistant to join meetings. List events, remove from active meetings, or cancel scheduled sessions. Works with Zoom, Google Meet, Teams.
4 tools
Media Embed
Create embeddable player widgets for your website. Update settings, check status, and get iframe URLs for any media file.
4 tools
Text Notes
Create text notes for AI analysis. Get NLP insights on any text. Re-run analysis with latest models. Update note content to trigger re-analysis.
5 tools
Exports, Search & Stats
Export as PDF, DOCX, SRT, VTT, TXT, CSV, or Markdown. Batch export with optional merge. Deep search across transcripts and insights. Workspace statistics and supported languages.
26 CLI commands for scripting and automation
Everything the MCP server can do, the CLI can do from your terminal. Upload files, pull transcripts, search across your library, and pipe results to other tools. Every command supports --json for scripting.
Media management
upload, list-media, get-transcript, get-insights, status, export, update, delete, favorites, captions, reanalyze
AI and search
ask lets you query media, folders, or your whole workspace with AI. chat-history lists past conversations. search does full-text search across all transcripts and insights.
Organization and more
list-folders, create-folder, clips, clip, stats, languages, schedule-meeting, create-text. Plus config and init for setup.
speakai-mcp upload ./interview.mp3 -n “Q1 Interview” –wait
# Get plain-text transcript
speakai-mcp transcript abc123 –plain > meeting.txt
# Ask AI about a specific recording
speakai-mcp ask “What were the action items?” -m abc123
# Search all transcripts
speakai-mcp search “pricing concerns” –from 2026-01-01
# Export as PDF with speaker names
speakai-mcp export abc123 -f pdf –speakers
# List videos as JSON for piping
speakai-mcp ls –type video –json | jq ‘.mediaList[].name’
Works with every major AI platform
Connect via remote URL (no installation) or install locally via npm. The auto-setup wizard detects your clients and configures them automatically.
Claude (Web, Desktop, Code)
Connect via remote MCP URL on claude.ai or install locally for Claude Desktop and Claude Code. Full access to all 81 tools through natural conversation.
ChatGPT
Add the Speak AI MCP server as a remote connector in ChatGPT settings. Use all transcription, analysis, and media management tools directly in chat.
Cursor & Windsurf
Add Speak AI to your coding IDE. Transcribe meeting recordings, pull insights, and search across your media library without leaving your editor.
VS Code
Use with VS Code MCP extensions. Access your transcription and analysis workspace alongside your code. Same 81 tools available.
Any MCP client (STDIO)
Standard STDIO transport works with any tool that supports the Model Context Protocol. Configure with the npm package path and your API key.
Terminal / Scripts
The 26 CLI commands work in any terminal. Pipe output to other tools, use in shell scripts, cron jobs, or CI/CD pipelines. Every command supports --json output.
What you can do
Example workflows that combine multiple tools. Your AI assistant chains these automatically in a single conversation.
Transcribe and analyze a meeting
“Upload and transcribe this recording.” The assistant calls upload_and_analyze, waits for processing, and returns the full transcript with speaker labels, sentiment, action items, and key topics.
Research across your library
“What themes came up across all our customer interviews this month?” The assistant searches your media, retrieves insights from each file, and synthesizes patterns across 12+ recordings in one response.
Join a meeting and summarize
“Join my 2pm Zoom call and send me a summary with action items.” The assistant schedules the meeting bot, then pulls the transcript and insights after the meeting ends.
Build a weekly brief
“Prepare a brief from all meetings in the last week.” The assistant lists recent media, pulls insights from each, and creates a consolidated brief with decisions, action items grouped by owner, and follow-ups.
Create highlight clips
“Find the best quotes about pricing from the last 5 interviews and create clips.” The assistant searches transcripts, identifies time ranges, and creates shareable clips from each recording.
Batch export for a report
“Export all recordings from the Q1 Research folder as PDFs with speaker names.” The assistant lists folder contents and runs batch export, merging results into downloadable files.
How Speak AI compares
Most transcription tools offer 5-15 MCP tools focused on meeting notes. Speak AI provides a complete media intelligence platform with 81 tools, a full CLI, and capabilities no competitor offers.
| Capability | Speak AI | Typical Meeting MCP |
|---|---|---|
| MCP tools | 81 | 5-15 |
| CLI commands | 26 | None |
| MCP resources | 5 | 0-1 |
| Built-in prompts | 3 | 0 |
| NLP analytics (sentiment, keywords, entities) | Yes, built in | No |
| Multi-model AI Chat | Claude, Gemini, GPT | Single model or none |
| Cross-file analysis | Search and query across entire library | Single meeting only |
| Upload from URL or local file | Both | URL only or none |
| Export formats | PDF, DOCX, SRT, VTT, TXT, CSV, Markdown | Text only |
| Embeddable recorder and surveys | Yes, 10 tools | No |
| Automations and webhooks | Yes, 9 tools | No |
| Meeting bot scheduling | Yes, 4 tools | Limited |
| Languages | 70+ | 10-30 |
| Open source | MIT license | Varies |
Resources and built-in prompts
MCP resources provide direct data access without tool calls. Built-in prompts run multi-step workflows with a single command.
5 MCP Resources
Direct access to your media library, folders, supported languages, and per-file transcripts and insights. Clients can read these URIs without making explicit tool calls.
analyze-meeting prompt
Upload a recording and get a complete analysis: transcript, insights, action items, and key takeaways. One prompt runs the full pipeline.
research-across-media prompt
Search for themes, patterns, or topics across multiple recordings or your entire library. Specify a topic and optional folder to scope the research.
meeting-brief prompt
Prepare a brief from recent meetings. Pulls transcripts, extracts decisions, and summarizes open items. Defaults to the last 7 days.
Why Speak AI built the most comprehensive MCP server for media intelligence
MCP (Model Context Protocol) is the open standard that lets AI assistants like Claude and ChatGPT connect to external tools and data. Instead of copying and pasting transcripts into a chat window, you give your AI assistant direct access to your entire Speak AI workspace. It can upload files, pull transcripts, run NLP analysis, search across recordings, create clips, and export results, all through natural conversation.
Most transcription platforms that offer MCP servers provide a handful of tools focused on meeting notes. Speak AI takes a fundamentally different approach. With 81 MCP tools across 12 categories, plus 26 CLI commands, the Speak AI MCP server covers every capability of the platform: media management, AI-powered analysis, folder organization, embeddable recorders and surveys, automations, webhooks, meeting bot scheduling, custom fields, and multi-format exports.
The CLI changes how teams work with media data
The 26 CLI commands turn Speak AI into a scriptable media intelligence engine. Upload recordings from a shell script. Pull transcripts and pipe them to other tools. Search across your entire library from the command line. Every command supports --json output for integration with jq, Python scripts, or CI/CD pipelines. Teams use the CLI for batch processing, automated reporting, and building custom workflows that would be impractical through a web interface.
Built for researchers, not just meetings
Meeting transcription is table stakes. Speak AI is built for qualitative researchers, consulting firms, market research teams, and anyone who needs to analyze conversations at scale. The MCP server lets you query across dozens or hundreds of recordings: “What pricing concerns came up in customer interviews this quarter?” or “Compare sentiment across focus groups by region.” Cross-file analysis is the differentiator that single-meeting tools cannot match.
NLP analytics built into every tool
When you upload a recording through the MCP server, Speak AI does not just transcribe it. Every file gets automatic NLP analysis: sentiment scoring, keyword extraction, topic detection, theme identification, and named entity recognition. These structured insights are available as JSON through the get_media_insights tool. Your AI assistant can use them to compare sentiment across recordings, find trending topics, or pull specific data points without re-processing the audio.
Open source and extensible
The Speak AI MCP server is open source under the MIT license. View the full source on GitHub, install from npm, and extend it for your own workflows. The server authenticates using your personal API key and only accesses data in your Speak AI workspace. All communication uses HTTPS encryption. Full API documentation is available at docs.speakai.co.
Frequently asked questions
What is an MCP server?
MCP (Model Context Protocol) is an open standard that lets AI assistants connect to external tools and data. The Speak AI MCP server gives AI assistants like Claude, ChatGPT, Cursor, Windsurf, and VS Code direct access to 81 transcription, analysis, and media management tools in your Speak AI workspace.
Which AI assistants work with the Speak AI MCP server?
Claude (web, Desktop, and Code), ChatGPT, Cursor, Windsurf, VS Code with MCP extensions, and any AI tool that supports the MCP protocol. The auto-setup wizard (speakai-mcp init) detects installed clients and configures them automatically.
Do I need to be a developer to use this?
No. The remote connector setup for Claude.ai and ChatGPT requires no coding or installation. Add the remote MCP URL in settings and authenticate with your API key. The npm install is for local setups and the CLI.
What is the CLI and how is it different from the MCP server?
The CLI provides 26 commands you run directly in your terminal: speakai-mcp upload, speakai-mcp ask, speakai-mcp search, and more. The MCP server provides 81 tools that AI assistants call during conversation. Both are included in the same npm package. Use the CLI for scripting, automation, and quick tasks. Use the MCP server when you want your AI assistant to work with your Speak AI data.
Is my data secure?
Yes. The server authenticates using your personal API key and only accesses data in your Speak AI workspace. All communication uses HTTPS encryption. The server is open source under the MIT license, so you can audit the code on GitHub.
How much does it cost?
The MCP server and CLI are free and open source. You need a Speak AI account to use them. API access is available on all paid plans, and you get full access during the free 7-day trial with no credit card required. See pricing for plan details.
What languages are supported?
Speak AI supports transcription in over 70 languages with automatic language detection. All languages work through both the MCP server and CLI. Speaker diarization, timestamps, and NLP analytics are available across all supported languages.
Can I use both the MCP server and CLI together?
Yes, and this is the recommended approach for power users. Use the MCP server when you want your AI assistant to orchestrate complex workflows through conversation. Use the CLI for quick tasks, batch operations, and scripting. Both share the same API key and access the same workspace data.
Start using Speak AI from your AI assistant or terminal
81 MCP tools, 26 CLI commands, and the most comprehensive media intelligence integration available for AI assistants. Get started in 2 minutes.
Get started free
Create an account, grab your API key, and connect your AI assistant. Full access during the 7-day trial. No credit card required.
View the source and docs
Open source under MIT. Full README with setup guides for every platform, tool reference, and CLI documentation.