xuezh (Chinese Learning Engine, ZFC / Unix-style)

Name: xuezh is short for 学中文 (learn Chinese).

This repo is a local learning engine for Mandarin study. It is designed to be used as a tool/skill behind a bot runtime + SOTA LLM (Clawdbot is the recommended integration), but it is also a plain CLI you can call however you want.

Authorship

Primary author: Codex using the gpt-5.2-codex model.

Recommended usage (Clawdbot)

Recommended: run xuezh as a CLI tool from a bot agent (Clawdbot) and parse JSON outputs.
Use a config file for credentials/behavior, and keep dependencies pinned in the bot's dev environment.
You will need an Azure Speech key + region (free tier is fine to start).

Clawdbot (upstream) repo:

https://github.com/steipete/clawdbot

Clawdbot is a local-first personal assistant that routes WhatsApp/Telegram/WebChat messages to an agent runtime. The Gateway is the control plane (sessions, providers, media, voice wake), while tools like xuezh are called on demand. Integration is simple: have the bot call the xuezh CLI and parse JSON responses, then surface the feedback back to the user. You can run the bot locally on your devices and keep all state under your control.

Key interaction flows (bot ↔ user):

Pronunciation feedback (audio → assessment)
User sends a voice note (e.g., “你好我叫小李。你叫什么？”).
Bot calls:
```
xuezh audio process-voice --in /tmp/voice.m4a --ref-text "你好我叫小李。你叫什么？" --json
```
Bot reads data.assessment + data.transcript and responds with targeted feedback.
Listen and repeat (text → audio)
User asks “How do I say …?”
Bot calls:
```
xuezh audio tts --text "你好" --json
```
Bot returns the audio_tts artifact as a voice note.
Progress recap (facts → summary)
User asks “How am I doing?”
Bot calls:
```
xuezh report hsk --level 6 --json
```
Bot summarizes the factual progress data. Use --level 7-9 if your dataset includes the 7–9 bucket.

Example screenshots (from a bot flow):

Example config (~/.config/xuezh/config.toml):

[azure.speech]
key_file = "/run/agenix/xuezh-azure-speech-key"
region = "westeurope"

[audio]
backend_global = "azure.speech"
process_voice_backend = "azure.speech"
convert_backend = "ffmpeg"
tts_backend = "edge-tts"
inline_max_bytes = 200000

Other usage (CLI)

This is a standard CLI. You can call it from any script or workflow as long as its dependencies are available:

nix run github:joshp123/xuezh
xuezh version --json
xuezh audio process-voice --in /path/to/voice.m4a --ref-text "你好" --json

Core commands + example outputs

Version:

$ xuezh version --json
{"ok":true,"schema_version":"1.0","command":"version","data":{"version":"0.1.0"},"artifacts":[],"truncated":false,"limits":{}}

Voice processing (pronunciation assessment + transcript):

$ xuezh audio process-voice --in /path/to/voice.m4a --ref-text "你好" --json
{"ok":true,"schema_version":"1.0","command":"audio.process-voice","data":{"assessment":{...},"transcript":{"text":"你好"}},"artifacts":[...],"truncated":false,"limits":{"inline_bytes_max":200000}}

Text-to-speech (audio artifact):

$ xuezh audio tts --text "你好" --json
{"ok":true,"schema_version":"1.0","command":"audio.tts","data":{"voice":"zh-CN-XiaoxiaoNeural"},"artifacts":[{"purpose":"audio_tts","path":"artifacts/audio/tts/....wav"}],"truncated":false,"limits":{}}

SRS review (recall vs pronunciation):

$ xuezh review start --json
$ xuezh review grade --item w_xxx --recall 4 --pronunciation 2 --json

Notes:

review start returns separate recall_items and pronunciation_items queues.
The --grade flag applies to recall only.

Key idea

Model = smart endpoint (lesson planning, choosing what to teach next, pedagogy)
Engine = dumb pipes (SQLite persistence, mechanical transforms, bounded reports, audio file materialization)

The engine must remain ZFC-compliant: no local ranking/selection heuristics; no “what should we do next” logic. The engine only returns primary sources and performs mechanical transforms. See docs/reference/zfc-zero-framework-cognition.md.

Audio pipeline architecture (STT/TTS)

Input normalization: all audio is normalized to WAV via ffmpeg before any backend call.
STT / assessment: audio.process-voice runs STT + pronunciation assessment (default backend is Azure Speech).
Artifacts: full raw outputs are stored as artifacts; the CLI response inlines only the actionable subset.
TTS: audio.tts uses edge-tts to materialize voice audio into an artifact.
Local fallback: whisper provides a local STT path when Azure isn't used.

Runtime dependencies

ffmpeg (audio conversion)
edge-tts (TTS voice)
whisper (local STT fallback; Azure is default)
Azure Speech SDK + credentials for pronunciation assessment

Azure notes:

You need an Azure Speech resource key/region (free F0 tier is fine to start).
Quick setup:
1. Create an Azure Speech resource (region westeurope is fine).
2. Grab the key + region from the Azure portal.
3. Put the key in your config file ([azure.speech] key_file = ...) and set region.
4. Run xuezh audio process-voice --in /path/to/audio.m4a --ref-text "你好" --json.
Free tier includes 5 audio hours/month for Speech to Text and 0.5M Neural TTS characters/month.
Pronunciation Assessment is billed at the baseline Speech to Text rate; prosody/grammar/vocabulary/topic are add-on charges.

Quick start (developer)

Enter the dev environment:
```
devenv shell
```
(Optional) Install the package in editable mode:
```
python -m pip install -e .[dev]
```
Run the CLI:
```
xuezh --help
xuezh version --json
```
Run tests:
```
pytest
```

Default dataset seed (HSK)

The repo bundles a pinned snapshot of ivankra/hsk30 under datasets/ivankra-hsk30/. Use it to initialize real HSK coverage (vocab + grammar only; levels 1–6).

python scripts/seed_hsk30.py --source datasets/ivankra-hsk30

Verify the DB has coverage:

xuezh report hsk --level 6 --json

Notes:

Characters are not imported by default (v1 scope). Add --include-chars if needed.
The seed script filters to levels 1–6. If your dataset includes a 7–9 bucket, import those rows separately; reporting supports --level 7-9.
Set XUEZH_WORKSPACE_DIR=~/.clawdbot/workspace/xuezh in Clawdbot so the bot and CLI share the same DB.

What’s included

schemas/ : JSON Schemas (contract stubs; to be enforced by tickets)
tests/fixtures/ : minimal dataset fixtures
datasets/ : pinned upstream HSK snapshot for local seeding
src/xuezh/ : Python package + CLI skeleton (xuezh)
tickets/ : implementation tickets (Beads source of truth)
specs/ : user requirements, BDD scenarios, and testing pyramid strategy
skills/chinese-learning-orchestrator/ : the Skill prompt glue (SKILL.md + references)
devenv.nix : dev environment skeleton (use this; do not install via global package managers)
docs/handoff/ : handoff prompt for the implementing agent
infra/azure/speech/ : OpenTofu scaffold for the Azure Speech resource

Project boundaries (important)

This repo does not implement the Clawdbot bot runtime/Gateway or any Telegram/WhatsApp/WebChat integration.
This repo does not implement pedagogy, recommendations, or personalization logic (that stays in the model/agent).
The skill (skills/.../SKILL.md) teaches the model how to use the engine, and encodes learning best practices.

Workspace / data path

The engine stores data under:

~/.clawdbot/workspace/xuezh/

Override via environment variables:

XUEZH_WORKSPACE_DIR
XUEZH_DB_PATH

Ticket execution method

Work items live in Beads (not tickets/; those were scaffold placeholders).
Implement tickets using the RGR pattern:

Red: write/enable tests
Green: minimal implementation to pass tests
Refactor: clean up without behavior change

See AGENTS.md.

References

Authoritative CLI spec: docs/cli-contract.md
Documentation map: docs/README.md
Out of scope (v1): specs/out-of-scope.md
Authoritative specs: specs/id-scheme.md, specs/events.md, specs/artifacts/retention.md
CI-style checks: ./scripts/check.sh
Contract coverage enforcement: tests/contract/

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.beads		.beads
.github/workflows		.github/workflows
cmd/xuezh-go		cmd/xuezh-go
datasets/ivankra-hsk30		datasets/ivankra-hsk30
docs		docs
infra/azure/speech		infra/azure/speech
internal/xuezh		internal/xuezh
migrations		migrations
packaging/nix-overlay-example		packaging/nix-overlay-example
schemas		schemas
scripts		scripts
skills		skills
specs		specs
tickets		tickets
.envrc		.envrc
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
devenv.lock		devenv.lock
devenv.nix		devenv.nix
devenv.yaml		devenv.yaml
flake.lock		flake.lock
flake.nix		flake.nix
garnix.yaml		garnix.yaml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

xuezh (Chinese Learning Engine, ZFC / Unix-style)

Authorship

Recommended usage (Clawdbot)

Other usage (CLI)

Core commands + example outputs

Key idea

Audio pipeline architecture (STT/TTS)

Runtime dependencies

Quick start (developer)

Default dataset seed (HSK)

What’s included

Project boundaries (important)

Workspace / data path

Ticket execution method

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

xuezh (Chinese Learning Engine, ZFC / Unix-style)

Authorship

Recommended usage (Clawdbot)

Other usage (CLI)

Core commands + example outputs

Key idea

Audio pipeline architecture (STT/TTS)

Runtime dependencies

Quick start (developer)

Default dataset seed (HSK)

What’s included

Project boundaries (important)

Workspace / data path

Ticket execution method

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages