pdf2anki

A CLI tool that automatically generates high-quality Anki flashcards from PDF, text, and Markdown files using Claude AI.

Built as a hands-on project to explore what's possible when you combine LLM structured extraction, cognitive science (Wozniak's 20 Rules), and a real evaluation framework — not just "ask the AI and hope for the best."

Features

Quality Assurance Pipeline: 6-dimension confidence scoring + LLM critique for automatic improvement
Wozniak's 20 Rules of Knowledge Formulation: Card generation based on cognitive science
8 Card Types: QA, term definition, summary, cloze, reversible, sequence, compare & contrast, image occlusion
Bloom's Taxonomy: Every card is tagged with a cognitive level (remember through create)
Image-Aware Card Generation: Detect and extract images from PDFs, generate visual cards via Claude Vision API
Interactive Review TUI: Review, accept, reject, and edit cards in a terminal UI before export
Section-Aware Processing: Heading-based document splitting with breadcrumb context for better card quality
Extraction Cache: SHA-256 content hashing skips redundant extraction on repeated runs
Batch API: 50% cost reduction for non-urgent bulk processing
Prompt Evaluation Framework: Keyword-based matching with Recall/Precision/F1 metrics for prompt quality measurement
Cost Tracking: Per-session API cost monitoring with configurable budget limits
OCR Support: Optional OCR fallback for image-heavy PDFs (via ocrmypdf)

Installation

git clone https://github.com/shimo4228/pdf2anki.git
cd pdf2anki
uv sync --all-extras

Environment Setup

cp .env.example .env
# Edit .env and set your ANTHROPIC_API_KEY

Variable	Required	Default	Description
`ANTHROPIC_API_KEY`	Yes	-	Claude API key
`PDF2ANKI_MODEL`	No	`claude-sonnet-4-5-20250929`	Override Claude model
`PDF2ANKI_BUDGET_LIMIT`	No	`1.00`	API cost budget limit (USD)

Usage

Convert

# Basic conversion (PDF → TSV)
pdf2anki convert input.pdf

# Specify output file
pdf2anki convert input.pdf -o output.tsv

# JSON output
pdf2anki convert input.pdf --format json

# Both TSV and JSON
pdf2anki convert input.pdf --format both

# Full quality assurance pipeline
pdf2anki convert input.pdf --quality full

# Interactive review before export
pdf2anki convert input.pdf --review

# Enable image-aware card generation (Vision API)
pdf2anki convert input.pdf --vision

# Use extraction cache for faster repeated runs
pdf2anki convert input.pdf --cache

# Use Batch API for 50% cost reduction
pdf2anki convert input.pdf --batch

# Process entire directory
pdf2anki convert ./docs/

# Add custom tags and focus topics
pdf2anki convert input.pdf --tags "chapter1,important" --focus "machine learning"

# Limit card count and budget
pdf2anki convert input.pdf --max-cards 20 --budget-limit 0.50

# Enable OCR for image-heavy PDFs
pdf2anki convert input.pdf --ocr --lang jpn+eng

# Combine options
pdf2anki convert input.pdf --cache --vision --review --quality full --format both

Option	Default	Description
`-o, --output`	auto	Output file or directory
`--format`	`tsv`	Output format: `tsv`, `json`, `both`
`--quality`	`basic`	QA level: `off`, `basic`, `full`
`--model`	from config	Claude model name
`--max-cards`	`50`	Maximum cards to generate
`--tags`	-	Additional tags (comma-separated)
`--focus`	-	Focus topics (comma-separated)
`--card-types`	all 7	Card types to generate (comma-separated)
`--bloom-filter`	all	Bloom levels to include (comma-separated)
`--budget-limit`	`1.00`	Budget limit in USD
`--review`	off	Open interactive TUI for card review
`--vision`	off	Enable image-aware card generation
`--cache` / `--no-cache`	off	Enable extraction cache
`--batch`	off	Use Batch API (50% discount, async)
`--ocr`	off	Enable OCR
`--lang`	`jpn+eng`	OCR language
`--config`	`config.yaml`	Path to config YAML
`--verbose`	off	Debug logging

Preview

Dry-run text extraction without API calls.

pdf2anki preview input.pdf
pdf2anki preview input.pdf --ocr

Eval

Measure prompt quality against a labeled dataset.

# Run evaluation
pdf2anki eval --dataset evals/dataset.yaml

# Output JSON report
pdf2anki eval --dataset evals/dataset.yaml --output eval-report.json

Configuration

Settings are loaded with priority: env vars > config.yaml > defaults.

See config.yaml for all options including model, quality thresholds, card types, cost limits, cache, vision, and OCR settings.

Architecture

[Input] PDF / TXT / MD
  ↓
[Step 1] Text Extraction (pymupdf4llm + OCR fallback + cache)
  ↓
[Step 2] Section Splitting (heading-based with breadcrumb context)
  ↓
[Step 3] LLM Structured Extraction (Claude API + Vision API for images)
  ↓
[Step 4] Quality Assurance (6-dim Confidence Score → LLM Critique)
  ↓
[Step 5] Cross-Section Deduplication
  ↓
[Step 6] Interactive Review TUI (optional)
  ↓
[Step 7] Output (TSV / JSON)

Quality Pipeline

Cards are scored across 6 dimensions (front quality, back quality, card type fit, bloom level fit, tags quality, atomicity). Cards below the confidence threshold are sent through LLM critique for improvement, splitting, or removal.

Project Structure

src/pdf2anki/
  main.py        # CLI (typer): convert, preview, eval commands
  config.py      # YAML + env var config loader
  schemas.py     # Pydantic models (AnkiCard, ExtractionResult, etc.)
  extract.py     # Text extraction (pymupdf4llm + OCR)
  section.py     # Heading-based section splitting
  structure.py   # LLM structured card extraction
  prompts.py     # Wozniak-based prompt templates
  quality/       # Quality assurance pipeline (heuristic, duplicate, critique)
  convert.py     # TSV/JSON output conversion
  cost.py        # API cost tracking
  service.py     # Service layer orchestration
  cache.py       # SHA-256 extraction cache
  image.py       # PDF image detection and extraction
  vision.py      # Claude Vision API integration
  batch.py       # Batch API support
  tui/           # Interactive card review (Textual)
  eval/          # Prompt evaluation framework
tests/           # 624 tests, 92%+ coverage

Requirements

Python 3.12+
uv
Anthropic API key

Docs

Contributing Guide - Development workflow, testing, CLI reference
Runbook - Deployment, troubleshooting, common issues

License

AGPL-3.0 License - See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
docs		docs
evals		evals
src/pdf2anki		src/pdf2anki
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdf2anki

Features

Installation

Environment Setup

Usage

Convert

Preview

Eval

Configuration

Architecture

Quality Pipeline

Project Structure

Requirements

Docs

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pdf2anki

Features

Installation

Environment Setup

Usage

Convert

Preview

Eval

Configuration

Architecture

Quality Pipeline

Project Structure

Requirements

Docs

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages