Whisper Transcribe Skill

A Claude Code skill for transcribing audio and video files using OpenAI's Whisper with context-grounding from markdown files.

Features

Audio/Video Transcription: Convert media files to text using OpenAI Whisper
Context Grounding: Uses markdown files in the same directory to improve accuracy for technical terms, names, and jargon
Multi-format Support: Works with mp3, wav, m4a, mp4, webm, and more
Cross-platform: Supports macOS (Homebrew) and Linux installations
Automated Workflow: Python script handles the full transcription pipeline

Installation

Quick Install with Skilz (Recommended)

The easiest way to install this skill is using the skilz universal installer:

npx skilz install SpillwaveSolutions_whisper-transcribe/whisper-transcribe

This command automatically downloads and configures the skill for Claude Code.

View on Skilz Marketplace: whisper-transcribe

Manual Installation

Clone the repository to your Claude Code skills directory:

git clone https://github.com/SpillwaveSolutions/whisper-transcribe.git ~/.claude/skills/whisper-transcribe

Prerequisites

After installing the skill, you need to install Whisper and ffmpeg on your system.

macOS (Homebrew)

brew install ffmpeg openai-whisper

Linux

# Install ffmpeg
sudo apt install ffmpeg  # Debian/Ubuntu

# Install Whisper
pip install openai-whisper

Verify Installation

whisper --version
ffmpeg -version

Usage

Basic Transcription

whisper /path/to/audio.mp3 --output_dir /path/to/output

With Context Grounding Script

python scripts/transcribe_with_context.py /path/to/audio.mp3 --model base --language en

The script will:

Find markdown context files in the same directory
Run Whisper transcription
Apply corrections based on context (technical terms, names)
Save both original and grounded transcripts

Model Selection

Model	Speed	Accuracy	RAM Required	Best For
tiny	Fastest	Lower	~1 GB	Quick drafts, testing
base	Fast	Good	~1 GB	General use
small	Medium	Better	~2 GB	Important recordings
medium	Slower	High	~5 GB	Professional transcription
large	Slowest	Highest	~10 GB	Critical accuracy needs

For MacBook Pro with Apple Silicon: small or medium models recommended for best speed/accuracy balance.

Context Files

Create markdown files in the same directory as your audio to improve transcription accuracy.

Example Context File

# Meeting Context

## Speakers
- Richard Hightower (host)
- Jane Smith (engineering lead)

## Technical Terms
- Kubernetes (container orchestration)
- FastAPI (Python web framework)
- AlloyDB (Google Cloud database)

## Acronyms
- CI/CD - Continuous Integration/Continuous Deployment
- PR - Pull Request

See assets/context-template.md for a complete template.

Project Structure

whisper-transcribe/
├── SKILL.md                        # Skill definition
├── README.md                       # This file
├── scripts/
│   └── transcribe_with_context.py  # Automated transcription script
├── references/
│   └── whisper-options.md          # Complete Whisper CLI reference
└── assets/
    └── context-template.md         # Template for context files

Triggers

This skill activates when users mention:

whisper, transcribe, transcription
audio to text, video to text, speech to text
meeting transcript, convert recording
File extensions: .mp3, .wav, .m4a, .mp4, .webm

Troubleshooting

"whisper: command not found"

# macOS
brew install openai-whisper

# Linux
pip install openai-whisper
export PATH="$HOME/.local/bin:$PATH"

"ffmpeg not found"

# macOS
brew install ffmpeg

# Linux
sudo apt install ffmpeg

Out of memory errors

Use a smaller model:

whisper "audio.mp3" --model tiny

Slow transcription

Use tiny or base model for faster results
Ensure correct architecture is being used (Apple Silicon vs Intel)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper Transcribe Skill

Features

Installation

Quick Install with Skilz (Recommended)

Manual Installation

Prerequisites

macOS (Homebrew)

Linux

Verify Installation

Usage

Basic Transcription

With Context Grounding Script

Model Selection

Context Files

Example Context File

Project Structure

Triggers

Troubleshooting

"whisper: command not found"

"ffmpeg not found"

Out of memory errors

Slow transcription

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
references		references
scripts		scripts
.gitignore		.gitignore
README.md		README.md
SKILL.md		SKILL.md

Folders and files

Latest commit

History

Repository files navigation

Whisper Transcribe Skill

Features

Installation

Quick Install with Skilz (Recommended)

Manual Installation

Prerequisites

macOS (Homebrew)

Linux

Verify Installation

Usage

Basic Transcription

With Context Grounding Script

Model Selection

Context Files

Example Context File

Project Structure

Triggers

Troubleshooting

"whisper: command not found"

"ffmpeg not found"

Out of memory errors

Slow transcription

Contributing

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages