Make sure you have Twilio installed by running the command:
pip install twilio
pip install openai
Install pyaudio
brew install portaudio
pip install pyaudio
Install dotenv
pip install python-dotenv
Install google gemini api if you want gemini AI summarizer:
pip install google.generativeai
- For Mac Users install the Mac Blackhole Audio Driver at this [link] (https://github.com/ExistentialAudio/BlackHole) (Make sure to download the 2ch version)
- Open the app "Audio MIDI Setup"
- Create a new device by clicking the bottom left "+" sign
- Select "Macbook Pro Speakers" and "Blackhole 2ch"
- Make sure "Drift Connection" is enabled on Blackhole 2ch
- Now set your audio to the sound you want it to be enabled at
- Now go to Settings -> Sound Output -> Output
- Select the Multi-Output Device or whatever the audio connection was named on steps 4 - 5 (This should lock in your computer volume changing)
You can get your Twilio phone number by following these steps or by watching this YouTube Video.
- Go to the Twilio Website
- Create a new account by clicking the sign-up button in the top right corner
- Verify your phone number through the signup process
- Verify your email address
- Complete the survey questions according to your preferences
- On your account dashboard, click on "Get a Twilio phone number"
- Scroll down to find the Account Info tab
- Expand the tab to locate your:
- Account SID
- Auth Token
- My Twilio Phone Number
- Copy and paste these credentials into the website
- Sometimes the app may crash after some uses, in these cases you may need to reset the cookies on
- Certain old softwares with old audio drivers may not be compatible with this software such as zoom, but google meetings, discord and MS teams have proved to be consistent
---
title: Meetings Pager
---
graph LR
A(Computer Audio Input) --> b(Local Transcriber)
A --> B(OpenAI's Whisper Model)
B -- Transcribes text--> C(Stores local transcript)
C --> D(Keyword/Name Detection from transcript)
D --> E(Sentence Filtering if your name is called for a good reason)
E --> F(Returns Summary on topics of what you've talked about)
E --> G(Calls your Phone)
b --> c(Keyword/Name Detection from transcript)
c --> d(Calls your Phone)
├── main.py # Main execution of the functions toether
├── docs # Documentation files (alternatively `doc`)
├── Components # Components of the pager
│ ├── call_component.py # Number Calling Functionality
│ ├── gui_component.py # Python Gui Maker
│ ├── gemini_componenet.py# Gemini prompting AI
├── Images/ # Miscellaneous Images
├── LICENSE # License to prevent people from commercializing our product
├── .gitignore # Telling github to ignore your credentials from being uploaded
├── README.md # This file for documentation
└── ...
