Smart Voice Summarizer

Clean it. Hear it. Summarize it. Know more, listen less.

Comment

Your one stop shop for summaries

Inspiration

As a student, it is very time consuming to sit and watch all recorded lectures. We wanted to aim at something that is simple, time efficient and accurate. Then we realized that this is an issue that people from all sectors face. This is what motivated us to come up with our problem statement.

What it does

It takes raw audio input, removes noise and transcribes it. It can then summarize it and convert the summary text to speech using Neuphonic's API.

How we built it

Python
Google Colab / VS Code
Whisper for transcription
Neuphonic for voice synthesis
noisereduce / DeepFilterNet for noise cleaning
Streamlit for frontend

Challenges we ran into

Cleaning audio files
Lack of free APIs
Inaccurate transcription

Accomplishments that we're proud of

Successfully integrating Neuphonic's API.
Finding alternatives to training models in a restricted timeframe

What we learned

We learned how to integrate different APIs in web applications and use tech stacks that we were not familiar with like Streamlit.

What's next for Smart Voice Summarizer

Due to constraints on time and resources, these are what we would like to implement later on:

More efficient noise removal/reduction
Quicker and more precise transcription
Customizable summary length ( audio and text)

Built With

apis
machine-learning
python
streamlit

Updates

sriharan11 Rajkumar started this project — Mar 23, 2025 06:49 AM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.