Inspiration

Replae started with a simple moment we’ve all felt:

You open an old notebook — messy lecture notes, a forgotten goal, a journal entry from when life felt different. You read it… but you don’t really hear it. Your thoughts stay trapped on the page: scattered, overwhelming, easy to forget.

We asked:

“What if your mind could speak back to you?”

That question became the soul of Replae.

We wanted a tool that could take handwritten thoughts — the most personal, human kind of thinking — and bring them to life again. Not by rewriting them, but by giving them a voice: clear, structured, and emotionally resonant.

Replae is for students drowning in notes, for presenters who lose their voice, for people with dyslexia or low vision, and for anyone who just wants their ideas to finally make sense out loud.

What it does

Replae transforms handwritten thoughts into clear, organized audio you can listen to — like giving your mind a voice.

With a single photo, Replae can:

  1. Extract handwriting

You upload an image of:

lecture notes

journal pages

to-do lists

childhood writing

planning pages

notes from someone you love

Replae reads it using advanced vision OCR.

  1. Turn it into clean, usable text

Replae processes the handwriting to:

preserve punctuation

fix spacing

clean noise

remove distortions

Your messy handwriting becomes readable digital text.

  1. Enhance your thoughts (Enhanced Mode)

Using Claude, Replae can instantly transform your handwritten text into:

structured study notes

clear summaries

prioritized to-do lists

presentation scripts

outlines

action steps

clarity-optimized rewrites

You decide what you want.

  1. Preserve the original meaning (Raw Mode)

If you want no AI rewriting:

Replae reads the text exactly as written

perfect for journals, memories, and notes from someone important

It keeps the emotion and authenticity intact.

  1. Convert everything to expressive audio

Replae plays the text out loud in:

A clean synthetic voice

Or a personal uploaded voice

Use cases:

studying while walking

reliving memories

practicing presentations

accessibility for dyslexia or low vision

speaking when your voice is gone

  1. Save and revisit your entries

Each upload becomes an entry you can:

replay

reorganize

talk with in Enhanced Mode

turn into additional audio

keep as a diary or script

In short:

Replae takes a picture of your handwriting and turns it into:

organized text

structured insights

clear summaries

spoken audio

and emotional memories brought back to life

It gives your thoughts clarity. It gives your ideas structure. It gives your memories a voice.

How we built it

Backend

Flask (Python) — handles image uploads, OCR requests, AI routing, audio generation, and multi-mode logic.

Frontend

React + TypeScript + Vite — fast, modular UI with real-time state updates and smooth transitions.

OCR (Optical Character Recognition)

OpenAI GPT-4.1 Vision API — accurately extracts text from handwritten notes, journals, photos, and pages.

AI Assistant

Anthropic Claude API — powers Enhanced Mode with:

summaries

outlines

presentation scripts

study notes

action items

clarity rewrites

Text-to-Speech

Fish Audio API — generates expressive, natural-sounding audio from extracted and enhanced text.

Styling

Tailwind CSS — custom gradients, glassmorphism effects, animations, responsive layout.

Icons

Lucide React — consistent and modern UI icon system.

Challenges we ran into

Integrating Multiple API Keys We worked with several external services (OpenAI Vision, Anthropic Claude, and Fish Audio), each with different authentication flows, rate limits, and response structures. Coordinating these systems and managing environment variables across local and deployed environments required careful debugging.

Merge Conflicts During Rapid Development Because we were building the frontend, backend, OCR pipeline, and AI logic in parallel, we frequently encountered merge conflicts. Rapid iteration meant we had to repeatedly resolve code overlaps, restructure files, and keep our branches aligned without breaking the build.

AI Hallucinations and Unreliable Output Enhanced Mode needed consistent summaries, outlines, and scripts. Early versions produced hallucinations, formatting drift, or missing structure. We had to design guardrails, structured prompting, and post-processing to maintain reliability.

OCR Accuracy With Handwriting Handwritten input varied widely. Achieving clear extraction required experimentation with rotation correction, contrast enhancement, thresholding, sharpening, and punctuation restoration. Even then, cleaning and normalizing the text became a significant challenge.

Accomplishments that we're proud of

Built a fully working pipeline from handwritten image → OCR → AI enhancement → audio output

Integrated OpenAI Vision, Anthropic Claude, and Fish Audio into a unified workflow

Created high-accuracy preprocessing for handwritten OCR (rotation fix, denoise, thresholding, contrast enhancement)

Designed a clean, modern, mobile-responsive UI with multiple modes

Implemented a reliable multi-mode AI assistant (Raw, Enhanced, Study, Presentation, Clarity)

Achieved stable audio replay using user voice samples

Delivered an emotional, impactful user experience that feels personal and useful

Finished the entire application end-to-end within the hackathon time

What we learned

How to preprocess images for better OCR accuracy (contrast boosting, rotation correction, thresholding, noise reduction)

How to integrate multiple AI services (OpenAI Vision, Anthropic Claude, Fish Audio) into a single pipeline

How to design structurable prompts that produce consistent summaries, scripts, and outlines

How to handle API rate limits, retries, and error states gracefully

How to build a responsive, mobile-ready React frontend with real-time text + audio features

How to collaborate quickly while resolving merge conflicts and syncing large changes

How to design a product around emotional impact, not just technical capability

What's next for Replae

• Audio and video uploads

Allowing users to upload voice notes, lectures, and video clips so Replae can extract text, organize it, and convert it into high-quality audio summaries or scripts.

• Full mobile app version

A native iOS/Android experience so users can capture notes, images, and audio directly from their phone and replay insights instantly.

• More voice options — synthetic and fictional

Support for a wider variety of AI voices, including synthetic professional tones, character-style voices, and more personal custom voices.

• Podcast mode

A feature that turns a series of uploaded notes into a structured audio diary or personalized “daily podcast,” organizing your thoughts into episodes.

• Better audio/voice storage system

More efficient, scalable, and secure storage of user voice uploads and generated audio, with better caching, organization, and retrieval.

Built With

  • ai
  • anthropic
  • api
  • assistant:
  • audio
  • backend:
  • claude
  • css
  • fish
  • flask
  • frontend:
  • gpt-4.1
  • icons:
  • lucide
  • ocr:
  • openai
  • python)
  • react
  • styling:
  • tailwind
  • text-to-speech:
  • typescript
  • vision
  • vite
Share this project:

Updates