Visulo - AI-Powered Study Coach

Visulo is a voice-first AI tutor that sees what you're studying. Capture screenshots and voice notes, then ask questions about your content using real AI models.

Features

📸 Screenshot Capture: Capture any screen content with OCR text extraction
🎤 Voice Notes: Push-to-talk voice recording with transcription
🧠 AI-Powered Answers: Get grounded answers from your captured content
🔍 Semantic Search: Find relevant information using vector embeddings
⚙️ Flexible Providers: Choose between local and cloud AI providers

AI Provider Options

Embedding Providers

Local Python (Default): Uses sentence-transformers locally (privacy-focused)
OpenAI: Cloud-based embeddings (requires API key)
Stub: Development/testing mode

LLM Providers

Ollama (Default): Local LLM via Ollama (privacy-focused)
OpenAI GPT-3.5: Cloud-based LLM (requires API key)
Stub: Development/testing mode

Quick Start

Prerequisites

Node.js (v16 or higher)
Rust (latest stable)
Python 3.7+ (for local embeddings)

Installation

Clone the repository
```
git clone <repository-url>
cd visulo
```
Install dependencies
```
npm install
```
Set up local embeddings (optional but recommended)
```
python setup_local_embeddings.py
```

Set up Ollama (optional but recommended)

# Install Ollama from https://ollama.ai
# Then pull a model:
ollama pull llama3

Run the development server
```
npm run tauri:dev
```

Configuration

Using Local Providers (Privacy-Focused)

Local Embeddings:
- Run python setup_local_embeddings.py to install sentence-transformers
- In Settings → Embedding Provider → Select "Local Python"
Local LLM:
- Install Ollama
- Pull a model: ollama pull llama3
- In Settings → LLM Provider → Select "Ollama"

Using OpenAI (Cloud-Based)

Get an OpenAI API key from OpenAI Platform
In Settings → Enter your API key
Select "OpenAI" for both Embedding and LLM providers

Usage

Basic Workflow

Capture Content:
- Click the camera button or press F9 for screenshots (works globally)
- Hold the microphone button or press Ctrl+Space for voice notes
AI Processing:
- OCR extracts text from screenshots
- Content is automatically indexed with embeddings
- AI generates contextual answers
Ask Questions:
- Your captured content becomes searchable
- AI provides grounded answers with citations
- View source snippets for each answer

Keyboard Shortcuts

F9: Quick screenshot (works globally, even when app is minimized)
Ctrl+Space: Push-to-talk (hold to record)

Architecture

Backend (Rust/Tauri)

Capture System: Screenshot capture, OCR processing
Indexing Service: Text chunking, embedding generation, SQLite storage
Retrieval Service: Semantic search, answer generation
Provider System: Modular AI providers (local/cloud)

Frontend (React/TypeScript)

Capture Interface: Screenshot and voice capture UI
History Sidebar: Real-time processing indicators
Settings Panel: Provider configuration
Toast System: User feedback and notifications

Data Flow

Screenshot/Voice → OCR/Transcription → Text Chunks → Embeddings → SQLite
                                                                      ↓
Query → Query Embedding → Similarity Search → Top-K Chunks → LLM → Grounded Answer

Development

Project Structure

visulo/
├── src/                    # React frontend
│   ├── components/         # UI components
│   └── styles/            # Tailwind CSS
├── src-tauri/             # Rust backend
│   └── src/               # Tauri application logic
├── public/                # Static assets
└── setup_local_embeddings.py  # Setup script

Building

# Development
npm run tauri:dev

# Production build
npm run tauri:build

Adding New Providers

Embedding Provider:
- Add variant to EmbeddingProvider enum
- Implement in EmbeddingService::generate_embedding()
- Update frontend settings UI
LLM Provider:
- Add variant to LLMProvider enum
- Implement in LLMService::generate_grounded_answer()
- Update frontend settings UI

Troubleshooting

Local Embeddings Issues

Ensure Python 3.7+ is installed
Run python setup_local_embeddings.py to install dependencies
Check that sentence-transformers is properly installed

Ollama Issues

Ensure Ollama is running: ollama serve
Check available models: ollama list
Pull a model if needed: ollama pull llama3

OpenAI Issues

Verify API key is correct and has credits
Check network connectivity
Ensure API key has appropriate permissions

General Issues

Check console logs for detailed error messages
Verify all dependencies are installed
Try restarting the application

Privacy & Security

Local Mode: All processing happens on your device when using Local Python + Ollama
API Keys: Stored locally, never transmitted except to respective services
Data Storage: All captures stored locally in SQLite database
No Telemetry: No usage data is collected or transmitted

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

[Add your license here]

Support

For issues and questions:

Check the troubleshooting section above
Review console logs for error details
Open an issue on GitHub with detailed information

Note: This is an early version. Some features may be experimental or require additional setup.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.vscode		.vscode
public		public
src-tauri		src-tauri
src		src
.gitignore		.gitignore
.prettierrc		.prettierrc
ADVANCED_MILESTONE_SUMMARY.md		ADVANCED_MILESTONE_SUMMARY.md
ASR_SETUP.md		ASR_SETUP.md
CHAT_DEBUG_FIXES.md		CHAT_DEBUG_FIXES.md
CHAT_WINDOW_FIXES.md		CHAT_WINDOW_FIXES.md
PHASE1_IMPLEMENTATION.md		PHASE1_IMPLEMENTATION.md
PHASE2_IMPLEMENTATION.md		PHASE2_IMPLEMENTATION.md
README.md		README.md
TAURI_2X_CONFIG_FIX.md		TAURI_2X_CONFIG_FIX.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
setup_local_embeddings.py		setup_local_embeddings.py
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Folders and files

Latest commit

History

Repository files navigation

Visulo - AI-Powered Study Coach

Features

AI Provider Options

Embedding Providers

LLM Providers

Quick Start

Prerequisites

Installation

Configuration

Using Local Providers (Privacy-Focused)

Using OpenAI (Cloud-Based)

Usage

Basic Workflow

Keyboard Shortcuts

Architecture

Backend (Rust/Tauri)

Frontend (React/TypeScript)

Data Flow

Development

Project Structure

Building

Adding New Providers

Troubleshooting

Local Embeddings Issues

Ollama Issues

OpenAI Issues

General Issues

Privacy & Security

Contributing

License

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages