Zardon Vision

Inspiration

One afternoon at the library, we met a young boy struggling to read because he was visually impaired and had forgotten his glasses. Seeing how something so small could create such a big barrier inspired us to build Zardon Vision — an AI-powered mobile assistant designed to increase independence and accessibility for visually impaired individuals.

What it does

Zardon Vision transforms a smartphone into a real-time visual assistant.

📷 Object & human detection using computer vision
📖 Text-to-speech (OCR) to read books, signs, and documents aloud
🧭 Depth awareness to better understand surroundings
🗣️ Siri shortcut integration for hands-free requests and emergency contact access

The app converts visual information into audio, allowing users to better navigate and understand their environment.

How we built it

Frontend: SwiftUI + UIKit
Backend: Python + Flask
AI/ML: PyTorch, YOLOv2, Create ML
Text Recognition: Apple VisionKit & AVFoundation
Website: HTML + CSS

Images are captured on-device and sent to our Flask server, where custom-trained PyTorch models classify objects and return results to be read aloud. For text, Apple’s Vision APIs detect and extract written content, which is converted into speech instantly.