Inspiration
The inspiration for this project was behind having to watch long and boring lectures. This challenge inspired us to create NoteConvert, which is an app that automatically transforms videos and images into comprehensive notes, making learning and information retention faster and more efficient.
What it does
Our app essentially takes in any form of media, and converts it into a detailed and comprehensive set of notes. This is very useful for people who don't have time to or don't want to watch a long video or read a large document. Instead, all they have to do is put the file/image/video into NoteConvert, and it will convert it into a set of notes.
How we built it
We built NoteConvert using a combination of front-end and back-end technologies:
The frontend was developed using React and CSS for a responsive, user-friendly interface. We implemented user authentication and built dynamic components that allow users to upload files and view results seamlessly.
- The backend was powered by Python and Flask. We used image and video processing libraries to handle the uploaded files. For text extraction from images, we used OCR (Optical Character Recognition) tools.
- The extracted text was sent to the OpenAI API for summarization. For videos, we employed speech recognition libraries to transcribe the audio before sending it for summarization.
Challenges we ran into
Building NoteConvert came with its fair share of challenges:
- Integration Issues: Integrating various APIs and ensuring smooth communication between the frontend and backend was complex. We had to debug multiple issues with API requests and data handling.
- Optimizing Performance: Processing large videos and images required significant computing resources, and we needed to optimize the code to reduce latency and improve user experience.
- User Experience: Designing an intuitive user interface that is both visually appealing and easy to navigate was a major focus. We iterated multiple times on the design to ensure that it met our high standards for usability.
What we learned
During this project, we learned a great deal about the power of AI and machine learning in text processing and summarization. We explored image recognition techniques and APIs like OpenAI's, which helped us understand how to effectively convert various media formats into structured, readable notes. We also gained hands-on experience with web development tools like React and CSS, creating a sleek, modern user interface that enhances user experience.
What's next for NoteConvert
There are many next steps we can take with NoteConvert. For one, we can even further enhance and develop the UI, to allow for a better User Experience. Also, we could further enhance the features we include, such as a chatbot for the user to communicate with and etc.
Built With
- css
- html
- javascript
- openai-api
- python
- react

Log in or sign up for Devpost to join the conversation.