Recipe Receptionist

For BC Hacks 6

Recipe Receptionist

Inspiration

We created Recipe Receptionist to solve the frustration of finding specific moments in long videos, like key lecture examples or unique cooking tips. We also wanted to make video content more accessible for people with disabilities, such as those with visual or hearing impairments, who often struggle to access or interpret videos.

What it does

The app lets users upload a video or paste a URL. It analyzes the video frame by frame, transcribes the audio, and lets users ask specific questions about the content. For example:

Students can ask, "What was the example problem solved at 30 minutes?"

Cooking enthusiasts can ask, "What was the chef’s secret tip?"

For accessibility

Visually impaired users: Get audio descriptions of visual content.

Hearing-impaired users: Receive accurate transcriptions and visual summaries.

Cognitive disabilities: Simplifies complex video content into easy-to-understand answers.

How we built it

Frontend: Streamlit for a simple interface.

Backend: Python for video and audio processing.

APIs: OpenAI’s Whisper for transcription and GPT-4 for answering questions.

Video Processing: OpenCV for frames and FFmpeg for audio extraction.

URL Handling: you-get to download videos from URLs.

Challenges we ran into

Slow video processing for long videos.

API rate limits from OpenAI.

Ensuring accessibility for disabled users.

Handling large video files and different formats.

Accomplishments that we're proud of

Integrating multiple technologies into a seamless app.

Making video content more accessible and inclusive.

Building a functional prototype quickly.

Positive feedback from the accessibility community.

There are many image detection/analysing apps but not many video ones, so this is in a league of it's own

What we learned

Balancing performance and accuracy in video processing.

How AI can make content more inclusive.

Handling large datasets and optimizing efficiency.

The importance of user feedback for improvement.

What's next for Recipe Receptionist Better Accessibility: Add sign language interpretation, text-to-speech, and real-time captions.

Faster Performance: Optimize processing for longer videos.

Smarter AI: Add summarization, keyword search, and emotion analysis.

User Customization: Let users adjust settings like frame rate and language.

Platform Integration: Work with YouTube, Vimeo, and Coursera.

Community Collaboration: Open-source the project and partner with accessibility organizations.

Setup

Requirements:

mmfpeg installed on device
all python modules in requirements.txt. You can run pip3 install -r requirements.txt in the root directory to download them all.
you MUST have the following evironment variable: OPENAI_API_KEY=<YOUR_API_KEY>. Set this in your respective shell.

To run the app: streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
.gitignore		.gitignore
README.md		README.md
app.py		app.py
backend.py		backend.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Recipe Receptionist

Inspiration

What it does

For accessibility

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Recipe Receptionist

Inspiration

What it does

For accessibility

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages