VisionMouse

VisionMouse is a hardware + software system designed to help visually impaired users explore complex visual data such as charts, maps, dashboards, and technical drawings.

The system replaces slow, linear screen reader navigation with spatial exploration using tactile and adaptive audio feedback.

Project Structure

Computer

Desktop UI (PySide6)
Screenshot capture
Multi-agent Gemini image analysis
Grid-based intensity + relevance mapping
ElevenLabs text-to-speech and sound effects
WebSocket communication with Raspberry Pi

Raspberry Pi

Optical motion / trackball input
Servo motor tactile feedback
Vibration motor support
Speaker output
Real-time feedback loop

How It Works

The user captures a screenshot of visual content.
The system analyzes the image using a multi-agent pipeline powered by Gemini.
The user selects a factor to explore (e.g., intensity, density, borders).
Moving the device produces:
- Continuous audio intensity changes
- Tactile feedback for non-relevant areas
- On-demand AI-generated descriptions

The grid-based approach allows fast spatial interaction without repeatedly sending cropped images to the model.

Technologies Used

Python
Raspberry Pi 5
PySide6
Google Gemini API
ElevenLabs API
WebSockets
GPIO / PWM control

Running the Project

Computer

cd Computer pip install -r requirements.txt python app.py

Create a .env file with:

GOOGLE_API_KEY=your_key ELEVENLABS_API_KEY=your_key

Raspberry Pi

cd RaspberryPi pip install -r requirements.txt python main.py

Ensure GPIO permissions, PWM, and audio output are configured properly.

VisionMouse is an experimental prototype developed for accessibility-focused exploration of visual data.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Computer		Computer
RaspberryPi		RaspberryPi
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionMouse

Project Structure

Computer

Raspberry Pi

How It Works

Technologies Used

Running the Project

Computer

Raspberry Pi

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VisionMouse

Project Structure

Computer

Raspberry Pi

How It Works

Technologies Used

Running the Project

Computer

Raspberry Pi

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages