Our Home Page
Pricing Page
Documentation / Install Page
Analytics Page
VsCode Extension
Bfs Code from talking

Speech Extension - VS Code Extension

Inspiration

As developers, we spend countless hours typing, debugging, and refactoring code. However, traditional coding methods can be limiting, especially for those with accessibility needs or those who want to enhance their workflow efficiency. Speech Extension was born from the idea of making coding more accessible, efficient, and hands-free. By integrating voice recognition into VS Code, developers can now speak their code, modify it, and even have it read back to them.

What it does

Speech Extension is a powerful VS Code extension that enables developers to code using voice commands. This tool enhances accessibility and productivity by integrating seamless speech-to-text and voice-driven code modifications.

Key Features:

Speak Code – Convert spoken words into code using speech-to-text technology.
Modify Code via Voice – Edit, delete, and manipulate code using natural language commands.
Text-to-Speech – Listen to your code being read aloud for better accessibility and review.
Hands-Free Coding – Reduce reliance on keyboard input and improve workflow efficiency.
Multi-Language Support (Coming Soon) – Expanding support to various programming languages.

How we built it

We leveraged the following technologies to bring Speech Extension to life:

VS Code API – To build the core extension functionalities.
OpenAI API – For speech recognition, transcription, and processing commands.
Faster Whisperer - Convert audio to text
Fast API - Send requests from extension to back-end server
Python – Backend processing and handling commands.
Node.js – Manages extension-related services.
Firebase – To track users and manage web services.
Tailwind CSS – For an elegant and user-friendly front-end experience.

Challenges we ran into

One of the biggest challenges we faced was optimizing the parsing speed to ensure real-time speech-to-code conversion. Efficiently handling voice input and transforming it into accurate, structured code required refining our parsing logic and improving response times.

Accomplishments that we're proud of

We are proud that our project is fully functional, allowing developers to speak, modify, and interact with their code using only their voice. Achieving seamless speech integration within VS Code was a significant milestone.

What we learned

Throughout the development process, we gained valuable insights into working with speech recognition APIs, optimizing real-time processing, and enhancing user experience within the VS Code environment. Additionally, we learned how to balance accuracy and speed when converting spoken commands into structured code.

What's next for Speech Extension

Our next steps include:

Publishing the extension to the VS Code Marketplace for wider accessibility.
Expanding language support beyond JavaScript, Python, and C++.
Adding customizable voice commands for a more personalized experience.
Enhancing AI-powered code assistance to make voice-based coding smarter and more intuitive.

Installation & Usage

Install from VS Code Marketplace

(Currently in progress – link will be added soon.)

Install Manually

Clone the repository: bash git clone https://github.com/tasnimhny/speech-app.git cd speech-app
Install dependencies: bash npm install
Build the extension: bash npm run build
Install backend dependencies: bash pip install -r requirements.txt
Launch in VS Code:
- Open the project in VS Code.
- Press F5 to run the extension in a new VS Code window.

How to Use

Activate Speech Mode:
- Open the Command Palette (Ctrl + Shift + P) and select Start Speech Mode.
- Alternatively, use a custom keyboard shortcut or a voice command (if configured).
Speak Your Code:
- Example: "Define a function called add that takes two arguments" → js function add(a, b) { return a + b; }
Modify Code via Voice:
- Example commands:
  - "Delete line 10" → Removes line 10.
  - "Change let to const on line 5" → Updates let to const on line 5.
  - "Read this function aloud" → Uses text-to-speech to read the function.
Stop Speech Recognition:
- Say "Stop listening" or use the Stop Speech Mode command.