Skip to content

Pedrohgv/whisper-clip-linux

 
 

Repository files navigation

WhisperClip: One-Click Audio Transcription

Example using WhisperClip

WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI's Whisper for free, making transcription more accessible and convenient. This is a fork from the original repo that:

  • Adds support to Linux as an operating system (tested on Fedora 41).
  • Refactors the transcription engine from Whisper to use Faster-Whisper, which uses CTranslate2 to speed up CPU inference.

Table of Contents

Features

  • Record audio with a simple click.
  • Automatically transcribe audio using Whisper (free).
  • Option to save transcriptions directly to the clipboard.

Installation

Installing using the .deb package

You can install the application on Debian-based systems using:

  • sudo apt install <PACKAGE_NAME.deb>
  • The .deb file can be found under releasesin the repository.
  • The .deb file was generated using AI assistance.

Prerequisites (if running natively using Python)

  • Python 3.8 or higher
  • CUDA is highly recommended for better performance but not necessary. WhisperClip can also run on a CPU. This was patched to use a lightweight implementation of Whisper that use a CPU as the main device.

Setting Up the Environment

  1. Clone the repository:

    git clone https://github.com/Pedrohgv/whisper-clip-linux.git
    cd whisper-clip
    
  2. Install the required dependencies:

    pip install -r requirements.txt
    

Choosing the Right Model

For English-only applications, .en models (e.g., tiny.en, base.en) tend to perform better.

To change the model, modify the model_name variable in config.json to the desired model name.

Usage

Run the application:

python main.py
  • Click the microphone button to start and stop recording.
  • If "Save to Clipboard" is checked, the transcription will be copied to your clipboard automatically.

Configuration

  • The default shortcut for toggling recording is Alt+Shift+R. You can modify this in the config.json file.
  • You can also change the Whisper model used for transcription in the config.json file.

Feedback

If there's interest in a more user-friendly, executable version of WhisperClip, I'd be happy to consider creating one. Your feedback and suggestions are welcome! Just let me know through the GitHub issues.

Acknowledgments

This project uses OpenAI's Whisper for audio transcription. This project is a fork from the original whisper-clip.

About

WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • Python 100.0%