An AI-powered reading assistant for visually impaired children with voice conversations, OCR-based book reading, and educational Q&A.
# 1. Install dependencies
npm install
# 2. Run automated setup
npm run setup
# 3. Add your OpenAI API key to .env file
# Get key from: https://platform.openai.com/api-keys
# 4. Start the system
npm run server- Speech-to-Text: Click the microphone button to speak naturally
- Text-to-Speech: AI responses are automatically spoken aloud
- Real-time Conversation: Interactive chat interface with AI
- LiveKit Integration: Real-time audio streaming and processing
- OpenAI GPT-3.5-turbo: Intelligent AI responses
- Customizable Settings: Adjust voice, speech rate, and pitch
- Responsive Design: Works on desktop and mobile devices
- Real-time Audio Processing: Advanced audio analysis and streaming
- Conversation Memory: Maintains context across messages
# Clone or download the project
# Navigate to project directory
# Install dependencies
npm install
# Run automated setup
npm run setup
# Edit .env file and add your OpenAI API key
# Start the system
npm run server# Install dependencies
npm install
# Create environment file
cp env.example .env
# Edit .env and add your OpenAI API key
# Get key from: https://platform.openai.com/api-keys
# Start the system
npm run server- Visit: https://platform.openai.com/api-keys
- Sign in with your OpenAI account
- Click "Create new secret key"
- Copy your API key (starts with
sk-)
# Edit .env file
OPENAI_API_KEY=sk-your-actual-api-key-here
LIVEKIT_URL=ws://localhost:7880
LIVEKIT_API_KEY=devkey
LIVEKIT_API_SECRET=secret
PORT=3000- HTML5, CSS3, JavaScript (ES6+)
- Web Speech API (voice recognition + synthesis)
- Tesseract.js (OCR)
- TensorFlow.js (ML models)
- Node.js + Express.js
- OpenAI API (conversational AI)
- Google OAuth 2.0
- LiveKit (real-time audio processing)
User Speech → Web Speech API → OpenAI GPT → Text-to-Speech → Audio Output
↑ ↓
└─────────── Conversation Loop ──────┘
npm run dev:full # Start both servers- Set up production environment variables
- Configure LiveKit Cloud or your own LiveKit server
- Deploy to your preferred hosting platform
- Set up SSL/HTTPS for secure connections
API Key Not Working
- Verify key format (starts with
sk-) - Check billing status on OpenAI account
- Ensure
.envfile exists and is properly formatted
Voice Not Working
- Check browser microphone permissions
- Try typing instead of speaking
- Check browser console for errors
Server Won't Start
- Run
npm installto install dependencies - Check
.envfile exists - Verify port 3000 is available
LiveKit Connection Issues
- LiveKit is optional for basic AI functionality
- Check LiveKit server is running
- Verify network connectivity
This project is open source and available under the MIT License.
Feel free to submit issues, feature requests, or pull requests to improve this application.
- OpenAI for GPT-3.5-turbo API
- LiveKit for real-time audio streaming
- Web Speech API for browser-based speech recognition
- Express.js for the web server framework