Ambient AI Scribe App Development: The Ultimate Guide

Konstantin Kalinin

Head of Content

April 13, 2026

Ambient AI scribe app development is hot because the pain is real: physicians spend roughly 34% to 55% of their time on clinical documentation, and that documentation burden is tightly linked to clinical burnout. Nuance DAX, Abridge, Suki AI, and Microsoft Azure AI Health Insights have turned ambient clinical documentation from a flashy demo into a real product category. Abridge alone raised $150 million in 2024, while Microsoft now packages ambient documentation directly into its healthcare AI stack.

The market is moving fast enough that EHR vendors can no longer shrug and pretend this is somebody else’s problem. Epic rolled out AI Charting in February 2026, and Oracle Health’s roadmap now explicitly includes ambient AI for documentation workflows.

A prototype is easy. A weekend and a few APIs will get you something that looks clever in a demo. Building one that is clinically accurate, HIPAA-compliant, EHR-integrated, and actually fast enough for a 15-minute visit is the hard part. That is the problem this guide maps.

What does it actually take to build an ambient AI scribe?

Building an ambient AI scribe takes much more than speech-to-text and a good LLM prompt. To work in real care settings, the product needs reliable audio capture, speaker diarization, structured entity extraction, safe note generation, HIPAA-ready handling of audio PHI, and direct EHR write-back through standards like FHIR and SMART on FHIR.

Key Takeaways:

A demo is easy; a deployable product is hard. Ambient scribes only become useful when audio capture, ASR, diarization, note generation, compliance, and EHR integration all work together under real clinical conditions.
The biggest technical shortcut is also the most dangerous one. Sending raw transcripts straight into an LLM is how teams get polished-looking notes that quietly invent or miss clinically important details.
The best place to differentiate is not plumbing. Most teams should buy core infrastructure where possible and focus their custom effort on specialty-specific templates and the physician review workflow, where trust and product value are actually won.

Table of Contents

What Is an Ambient AI Scribe?
The Five-Stage Technical Pipeline — Where the Hard Problems Live
Audio Capture and ASR — The Foundation Layer
Speaker Diarization — The Underrated Hard Problem
LLM Note Generation — Prompt Engineering for Clinical Accuracy
HIPAA Compliance for Ambient AI Scribes — Audio PHI Is Different
EHR Integration — Writing Notes Back Where Physicians Live
The Build-vs-Buy Decision Matrix — What to Build, What to Use as API
Specialty-Specific Requirements — Where Generalist Products Fail
FDA Classification for Ambient AI Scribes
The Ambient AI Scribe Build Checklist
Why Choose Topflight for Ambient AI Scribe Development

What Is an Ambient AI Scribe?

An ambient AI scribe is a software system that listens to a clinical encounter in the background and turns that conversation into a structured draft note without requiring the clinician to dictate it line by line.

That is the key distinction from old-school dictation tools: the doctor is not narrating a note for the machine. The system is capturing a real conversation, extracting the clinically relevant parts, and shaping them into ambient clinical documentation AI that supports physician documentation rather than interrupting it.

In practical terms, this is one of the clearest examples of ambient clinical intelligence being applied to real care workflows. Technically, the workflow has five stages:

Audio capture: recording the encounter through a room microphone, mobile device, or dedicated hardware
Automatic speech recognition (ASR): converting speech into text with enough medical vocabulary accuracy to be useful
Speaker diarization: separating who said what: physician, patient, family member, nurse, interpreter
Clinical NLP and entity extraction: identifying the usable clinical content such as symptoms, medications, findings, assessment, and plan
Structured note generation: turning those extracted facts into a SOAP note or another clinical note template using an LLM

That is why ambient scribing sits inside the broader shift toward conversational AI in healthcare, but with much stricter requirements. A chatbot can be vague and still feel clever. A clinical note cannot. Each layer in this stack brings its own accuracy, latency, and compliance problems, which is where most products stop being impressive and start being hard.

The Five-Stage Technical Pipeline — Where the Hard Problems Live

If you strip away the marketing gloss, ambient AI scribe development is a five-stage pipeline. Each stage has a distinct job, a different failure mode, and its own compliance exposure. That is why ambient scribe products are easy to demo but hard to make reliable in production.

Stage	Input	Processing	Output	Target Latency	HIPAA Risk & Notes
1. Audio Capture	Microphone or device audio stream	Noise filtering, compression, streaming buffer management	Audio stream or chunked segments sent to ASR	< 200 ms buffer	Critical — audio from a clinical encounter is PHI immediately. The capture endpoint must be HIPAA-eligible, and third-party analytics on the audio stream are asking for trouble.
2. ASR / Transcription	Audio stream (PCM or compressed)	Medical-vocabulary transcription, real-time or near-real-time processing	Raw transcript with timestamps and confidence scores	< 2 seconds per segment	High — the transcript contains full encounter PHI, so any ASR vendor in the loop needs a BAA.
3. Speaker Diarization	Transcript + audio stream	Speaker embedding, physician/patient classification	Speaker-labeled transcript	Real-time preferred; up to 5 seconds acceptable	Medium — diarization errors ripple downstream and poison everything that follows.
4. Clinical NLP / Entity Extraction	Speaker-labeled transcript	Large language model (LLM) or fine-tuned NLP processing, medical entity tagging, intent classification	Structured clinical entities: symptoms, diagnoses, medication, findings, plan items	10–30 seconds after encounter	High — this is dense PHI processing, and it is where medical entity extraction becomes the safety layer rather than a nice-to-have.
5. Note Generation	Structured entities + encounter context	Template-driven structured note generation using GPT-4, Claude, or another model	Draft note in SOAP, APSO, H&P, or other format	30–90 seconds after encounter	High — the generated note is a clinical document, and hallucination risk means physician review is mandatory before write-back.

The hard parts are not evenly distributed. Audio capture is technically the simplest layer, but it creates PHI the moment recording starts. Speaker diarization is where many products quietly fall apart in real exam rooms: similar voices, overlapping speech, interpreters, family members, hallway noise. Then comes note generation, where the system can produce text that sounds polished and still gets the medicine wrong. That is the dangerous kind of wrong.

The most common architectural mistake in this stack is also the laziest: sending the raw transcript straight into the note-generation model. That shortcut produces output that feels fluent but cannot be audited cleanly. It may invent details that were never discussed, miss clinically relevant facts phrased casually, or flatten nuance that matters. In gen AI in healthcare, fluency is cheap. Reliability is not. The entity-extraction layer is the point where the pipeline stops being a demo and starts behaving like software you could actually trust.

Audio Capture and ASR — The Foundation Layer

Every ambient scribe product lives or dies on its capture and transcription layer. Clinical environments are messy: overlapping speech, accents, whispers, monitor noise, hallway chatter, and enough specialized terminology to make generic speech models look confident and wrong at the same time.

Audio Capture Architecture

For many exam-room workflows, a mobile device microphone is good enough. A tablet or phone placed in the room can capture intelligible speech without exotic hardware, provided the placement is consistent and the room is not acoustically cursed.

For real-time transcription, WebRTC is the practical default because it already handles audio streaming, echo control, and basic noise cancellation better than most teams can build from scratch.

A few implementation choices matter more than teams expect:

Stream audio in chunks every few seconds if you want live transcript feedback during the visit
Reprocess the full encounter after the visit if final-note accuracy matters more than flashy live output
Use 16-bit PCM at 16 kHz as a common baseline for speech capture, then compress with OPUS or AAC for transport
Normalize across device types early, because room mics, tablets, and desktops will not behave the same way

ASR Selection — The Medical Vocabulary Problem

This is where the foundation either holds or quietly embarrasses you. General-purpose speech recognition can work for demos, but production-grade medical ASR needs stronger clinical vocabulary handling, drug-name recognition, and better medical terminology recognition than generic models usually provide.

The current shortlist is familiar:

Deepgram: enterprise speech stack with healthcare positioning and BAA support for eligible customers
AssemblyAI: offers a BAA and explicitly markets medical transcription and healthcare workflows
Azure Speech Services: Microsoft supports HIPAA/HITECH compliance across in-scope services under its BAA framework
AWS Transcribe Medical: purpose-built medical transcription, with streaming and batch modes, and listed as HIPAA-eligible
Whisper: open-source model that can be self-hosted to keep PHI entirely within your own infrastructure; OpenAI’s hosted API endpoint is also BAA-eligible with zero default data retention, but requires signing a BAA with OpenAI and enabling ZDR at the organization level before processing any PHI

The real builder warning is not “pick vendor X.” It is this: do not assume that a strong demo transcript equals production readiness. In healthcare, the ASR layer is not just a convenience feature. It is the part that decides whether the rest of the stack is working with signal or just polished nonsense.

Evaluating ASR providers for a clinical ambient scribe? Ask Topflight about benchmark data on medical vocabulary accuracy and latency across major providers in clinical environments.

Speaker Diarization — The Underrated Hard Problem

Speaker diarization — figuring out who said what — is the part ambient scribe teams chronically underestimate. Demos usually show a clean two-person exchange in a quiet room. Real clinics are less polite.

When diarization fails, note accuracy falls apart fast. A physician statement lands in the patient history. A patient symptom gets misread as the clinician’s assessment. Once transcript lines are misattributed, no amount of clever prompting or downstream cleanup will reliably fix it. The model is now building on bad input, and garbage remains faithful to its traditions.

Diarization Architecture Options

There are a few realistic paths:

AssemblyAI or Deepgram diarization: easiest to integrate, good enough for cleaner two-speaker audio, less reliable once the room gets messy
pyannote.audio: open-source, self-hostable, and designed for speaker diarization pipelines that can be fine-tuned to your own data
NVIDIA NeMo: more infrastructure-heavy, but built for serious diarization workflows and noisy audio environments
Directional microphone arrays: higher hardware and setup cost, but often worth it when physician review quality matters more than keeping the BOM pretty

The practical takeaway is simple: if you care about production-grade notes, do not rely on post-hoc diarization of one messy mixed stream unless you enjoy debugging ghosts. A stronger setup is to separate channels as early as possible, then merge labeled streams later. More expensive, yes. Also much closer to something a clinician would trust twice.

LLM Note Generation — Prompt Engineering for Clinical Accuracy

This is the stage where most teams pour in the love, and where the biggest mistakes get dressed up in fluent prose. In clinical note generation AI development, the real risk is not awkward wording. It is a note that sounds polished while quietly inventing facts.

The Two-Step Architecture: Extract First, Write Second

Do not generate notes directly from raw transcripts. The safer pattern is a two-step pipeline.

First, use an LLM or named entity recognition (NER) layer to extract structured facts from the transcript: chief complaint, HPI elements, medications, findings, diagnoses, plan items, and follow-up instructions. Output that as structured data tied to source transcript spans.

Second, generate the note from that structured payload, not from the raw conversation. That makes hallucination easier to control because every line in the output can be traced back to extracted evidence. It also gives clinicians a faster validation checkpoint before final note creation.

Prompt Engineering for Clinical Notes

Good prompt engineering here is less about sounding clever and more about reducing variance. The prompt should define the specialty, note format, required sections, EHR formatting constraints, and one hard rule: use only facts supported by the extracted data. If information is incomplete, mark it as incomplete instead of improvising.

Low-temperature generation usually works best because clinicians do not want creative variations of the same visit note. They want stable output. Specialty-specific templates matter too. A psychiatric SOAP note and an urgent care note do not just differ cosmetically; they encode different clinical context and documentation expectations.

You can also use retrieval-augmented generation (RAG) or tightly scoped reference material to inject local templates and specialty rules. Save LLM fine-tuning for cases where prompting alone stops carrying the load.

Model Selection

Today’s practical options are familiar: GPT-4 via OpenAI’s API for eligible healthcare customers under a BAA, Claude API through Anthropic’s HIPAA-ready offering, Azure OpenAI for teams already standardized on Microsoft, and self-hosted open models such as Llama or Mistral when PHI control and cost matter more than frontier reasoning quality.

The builder mistake to avoid is one-shot note generation from raw transcripts. That is how teams get notes that look finished, pass a casual skim, and still misstate the medicine. Also known as the fastest route to regrettable demos.

Teams considering GPT-4 for clinical notes should also review our guide to ChatGPT HIPAA compliance before sending any PHI through an external model API.

HIPAA Compliance for Ambient AI Scribes — Audio PHI Is Different

An ambient AI documentation tool does not just process text. It captures a clinical conversation in real time, which means it is dealing with protected health information (PHI) from the moment the encounter is recorded.

Under HIPAA, individually identifiable health information is protected in any form or medium, including oral information. In plain English: if a recording can identify the patient and contains health information, treat it as PHI.

What Audio PHI Requires Under HIPAA

The first practical consequence of handling HIPAA audio PHI is contractual, not glamorous. Every vendor that stores, transmits, or processes that recording may need a business associate agreement (BAA) in place:

cloud hosting,
audio storage,
ASR providers,
any LLM service that receives transcripts or extracted entities.

AWS, Microsoft, OpenAI, Anthropic, Daily, and LiveKit all publicly describe HIPAA/BAA pathways for eligible healthcare use cases, but you still need to confirm the exact service, configuration, and contract scope. That is why AI in healthcare compliance is mostly about architecture discipline, not checkbox theater.

The second consequence is lifecycle control. Audio data retention should be limited to what is operationally necessary for note generation, QA, and dispute resolution, then deleted automatically according to a documented policy.

HIPAA’s “minimum necessary” standard does not hand you a magic retention number, but it absolutely punishes vague “we’ll keep it just in case” thinking. Encryption in transit and at rest is table stakes, and SOC 2 is helpful, but neither replaces a real HIPAA design review.

For deeper platform-level patterns, this is the same discipline you would expect in HIPAA compliant software development or HIPAA compliant app development.

The Consent UX Problem

Patient consent is both a legal and product problem. Patients should understand what is being recorded, why, who can access it, and how long it is retained. The flow should be clear, revocable, and logged with versioning and timestamps. If the consent screen reads like a hostage note from Legal, adoption drops. If it is too casual, your compliance posture starts to smell optimistic.

The BAA Chain Teams Miss

The easiest miss in ambient scribing is the media layer. If your WebRTC stack falls back to a third-party TURN relay, that relay may be handling PHI. Daily explicitly offers HIPAA support, and LiveKit publishes HIPAA-eligible services, which is exactly why teams need to inspect the full path rather than just the app UI.

EHR Integration — Writing Notes Back Where Physicians Live

A medical scribe AI app that produces a note outside the chart saves less time than its demo suggests. If the clinician has to copy, paste, reformat, and reattach context manually, the product is no longer helping the clinical documentation workflow. It is just moving the mess around.

Real EHR integration means getting the note back into the system where the physician already works.

FHIR Resources for Clinical Note Write-Back

At the data layer, FHIR R4 gives you the right building blocks.

FHIR DocumentReference is the practical starting point for many generated notes because it can carry document metadata, author, status, and the link to the encounter.
FHIR Composition is better when you need a richer multi-section structure such as an H&P or discharge summary.
Encounter ties the note to the specific visit.
DiagnosticReport may matter in specialties where the generated output includes interpretation or diagnostic content.

SMART on FHIR for Embedded Launch

SMART on FHIR is what makes the app feel native. It lets the scribe launch from the patient chart with patient- and encounter-level context already available, instead of forcing a separate login and manual patient selection.

That is the path of least resistance for adoption, whether you are dealing with Epic integration, Cerner integration, or smaller EHR vendors such as Athenahealth that support modern API-based workflows.

For Epic-specific patterns, see our guide on how to integrate with Epic EHR. For the broader strategic angle, this ties directly into How will AI help change EHR? and even operational downstream topics like EHR in medical billing.

Physician Review Queue Design

The physician review queue is where trust is either built or quietly lost. The note should be:

editable inline
support one-click note approval
show which transcript spans support each generated claim

Side-by-side transcript tracing matters more than clever UI polish. If the doctor cannot quickly see why the system wrote a sentence, EHR write-back becomes a liability instead of a time-saver.

The Build-vs-Buy Decision Matrix — What to Build, What to Use as API

Nobody serious tries to build ambient AI scribe infrastructure entirely from scratch. The smarter question is where custom work creates real product leverage and where it just burns runway while your competitors ship. For most teams, the answer looks like this:

Capability	Build Argument	Buy / Use API Argument	Recommendation
Audio capture and streaming	Full control over device behavior and potential hardware IP	WebRTC is messy; platforms like Daily and LiveKit already handle device compatibility and media plumbing	Buy API
Medical ASR	Model ownership, tighter PHI control, lower marginal cost at scale	Providers like Deepgram and AssemblyAI already offer healthcare-ready speech workflows and BAA paths	Buy API
Speaker diarization	Potential edge in noisy clinical environments	Off-the-shelf or open-source options are usually enough early on	Usually buy
Clinical NLP / entity extraction	Specialty ontology can become a moat	GPT-4- or Claude-class models can extract a lot with good prompting	Depends
Note generation LLM	Lower cost and tighter control with self-hosting	Frontier models still outperform smaller custom stacks on messy reasoning	Usually buy API
Note template library	This is where specialty differentiation lives	No off-the-shelf product really solves it for you	Build
Physician review UI	Core workflow and trust layer	Also not something you should outsource if you want users to stick	Build
FHIR write-back / EHR integration	More control over mappings and target systems	Health Gorilla and Particle Health both position themselves as interoperability infrastructure layers with FHIR APIs	Usually buy API
HIPAA compliance infrastructure	Maximum control, maybe lower long-term cost	Aptible explicitly positions itself as HIPAA infrastructure for digital health teams	Usually buy

The highest-leverage build bets are not glamorous infrastructure trophies. They are the physician review interface and the specialty-specific note template library.

That is where generalist players like DeepScribe and Nabla are naturally constrained by broad product architecture, and where a narrower competitor can actually feel better for a specific workflow.

In other words, do not waste your team’s best engineers rebuilding plumbing unless your strategy is to become a plumbing company.

Specialty-Specific Requirements — Where Generalist Products Fail

Most ambient scribe platforms are optimized for general ambulatory workflows. That is where documentation volume is highest, and where the market first felt the pain of physician burnout. But that also creates the category’s blind spot: what works in primary care often breaks once the workflow gets more specialized.

In AI medical scribe software development, the real moat usually comes from specialty-specific documentation, not from squeezing out another benchmark slide.

Mental Health Documentation

Mental health notes are easy to get wrong in ways that still look polished.

A generic SOAP structure may miss core elements like the Mental Status Exam
PHQ-9 and GAD-7 scores need to be structured and tied to the treatment plan
Therapy visits and medication-management visits should not use the same template
Confidentiality rules and mandatory-reporting issues make documentation logic more sensitive than in standard ambulatory care

That is also why teams building for behavioral health should take therapy chatbot compliance seriously instead of treating it like legal wallpaper.

Emergency Medicine Documentation

Emergency medicine is where neat ambient assumptions go to die.

Encounters are noisy, interrupted, and often involve multiple simultaneous speakers
Directional room-level capture usually matters more than a simple device mic
Notes often need to be ready within minutes, not after leisurely batch processing
Since ED E/M coding now centers on medical decision-making, the scribe must capture reasoning, not just symptoms and disposition

Surgical and Procedural Documentation

Procedural workflows are even less forgiving.

Notes need required elements such as indication, consent, technique, findings, complications, and post-procedure assessment
Ambient capture alone is usually not enough; structured dictation support often still matters
Procedure-specific templates are critical because surgeons repeat patterns, not generic visit structures
Even the after-visit summary may need specialty-specific logic when follow-up instructions carry medical-legal weight (wikem.org)

That is the recurring pattern across specialties: the model is only part of the product. The template, workflow, and review logic are where generalist tools start to feel generic in the bad sense of the word.

FDA Classification for Ambient AI Scribes

This is the question nearly every ambient scribe builder asks, and many answer with more confidence than accuracy. A standard ambient AI scribe that transcribes a visit and generates a draft note for clinician review is often positioned as FDA non-device software, provided it stays on the documentation side of the line rather than drifting into diagnosis or treatment support.

The legal backdrop is the Cures Act’s software exclusions and the FDA’s current CDS framework.

That safer posture depends on a few conditions:

the output is a draft for physician review and approval
the software documents what happened; it does not recommend what to do next
the function supports workflow and recordkeeping rather than clinical decision-making

The line gets crossed when the product starts adding diagnoses, recommending plan items, surfacing “next best actions,” or generating alerts or scores that influence care. FDA’s current guidance is explicit that software analyzing patient-specific information to detect life-threatening conditions or support time-critical decisions can be a device function.

That is where the clinical decision support exemption starts to disappear fast, whether the output is packaged as a suggestion, an alert, or something delivered through CDS Hooks.

That is also why “AI-suggested additions” is regulatory napalm in a polo shirt. For the broader regulatory picture, see our guide to health AI FDA clearance.

The Ambient AI Scribe Build Checklist

Before deploying in a clinical environment with real patient encounters, confirm the following.

Audio and ASR

Medical ASR provider selected, with BAA available and executed
Audio capture tested in the target clinical environment, not in a suspiciously quiet demo setup
Speaker diarization tested with same-gender speakers, overlapping speech, and three or more speakers
ASR word error rate validated against specialty-specific vocabulary
Audio retention policy documented, with automated deletion implemented

HIPAA Compliance

BAA chain complete across hosting, ASR, LLM, storage, and WebRTC/TURN infrastructure
Patient consent flow implemented so it is transparent, revocable, and logged
Audio PHI data flow documented end to end: capture → ASR → entity extraction → note generation → EHR write-back → deletion
Minimum necessary principle enforced, with audio deleted after note generation unless a specific retention need is documented
Audit logging enabled for transcript access, note access, and physician review events

Note Quality and Safety

Two-step architecture in place: entity extraction first, note generation second
Hallucination testing completed on a meaningful set of real-world encounters
Physician review queue designed for fast approval without sacrificing traceability
Entity-level confidence indicators surfaced in the review UI
Every generated note element traceable back to a source transcript segment

EHR Integration

FHIR DocumentReference or Composition mapped for the target EHR workflow
SMART on FHIR launch context implemented if the app is embedded in the chart
Draft → reviewed → final note status mapped cleanly to the EHR lifecycle
Epic App Orchard or equivalent review process initiated where required

Why Choose Topflight for Ambient AI Scribe Development

Topflight builds regulated clinical AI products for teams that need more than a polished prototype. We work across the full stack that ambient scribes actually depend on:

audio capture and WebRTC pipelines
medical ASR
speaker diarization
LLM-driven note generation
FHIR-based EHR write-back
the HIPAA controls required to move from demo to deployment

In short, Topflight builds for healthcare workflows where HIPAA, EHR integration, and AI behavior all have to hold up in production, not just in a demo.

That matters because ambient scribing is not one feature. It is a chain of brittle systems that all have to work together under clinical and compliance pressure. Our work on GaleAI reflects that kind of complexity: Topflight helped build an AI-powered medical coding platform with EHR and medical API integration, a reported 97% reduction in coding time, and $1.14M in identified yearly lost revenue.

If you are evaluating ambient AI scribe app development, the real question is not whether a model can draft a note. It is whether your product can survive real workflows, real PHI, and real clinicians. That is the part we build for.

Frequently Asked Questions

How does an ambient AI scribe work technically?

It captures encounter audio, transcribes it, separates speakers, extracts clinical facts, and turns those facts into a draft note for clinician review. In mature products, the note is then written back into the EHR rather than left in a separate app.

What is the best ASR model for medical ambient scribing?

There is no universal winner. The best choice depends on your specialty vocabulary, latency target, diarization quality, hosting model, and BAA terms.

Is an ambient AI scribe a medical device under FDA regulations?

Usually not if it only creates a draft note for clinician review and does not make diagnostic or treatment recommendations. It starts looking more like device software when it adds patient-specific recommendations, alerts, or risk scoring.

Is audio of a clinical encounter considered PHI under HIPAA?

Yes, if the recording contains individually identifiable health information. HIPAA protects health information in any form or medium, including oral communications and recordings derived from them.

Does my ambient scribe need a BAA with OpenAI or Anthropic?

Yes, if you send PHI to those vendors, you need the right healthcare/BAA arrangement in place for eligible services. OpenAI and Anthropic both now describe HIPAA-ready or BAA-supported paths for healthcare use.

How do I handle patient consent for ambient AI recording?

Explain clearly what is being recorded, why, who can access it, and how long it is kept. The consent flow should be easy to understand, revocable, and logged.

What FHIR resources are used to write AI-generated notes to an EHR?

Common choices include DocumentReference or Composition for the note itself, plus Encounter to tie it to the visit. SMART on FHIR is often used to launch the app in the right chart context.

How do I prevent AI hallucinations in generated clinical notes?

Do not generate notes directly from raw transcripts. Extract structured clinical entities first, then generate the note from that structured data and require clinician review before final write-back.

What makes ambient scribing different for mental health vs. primary care?

Mental health notes often need different templates, scoring instruments, confidentiality logic, and session-type distinctions. A generic primary-care workflow may still produce readable notes there, but readable is not the same as correct.