Skip to content

docs(server): drop phantom --talker-model + --code2wav-model rows#85

Merged
marksverdhei merged 1 commit into
htfrom
docs/server-readme-phantom-flags
Jun 12, 2026
Merged

docs(server): drop phantom --talker-model + --code2wav-model rows#85
marksverdhei merged 1 commit into
htfrom
docs/server-readme-phantom-flags

Conversation

@marksverdhei

Copy link
Copy Markdown

Summary

Epoch #81 task 5 (docs review on `tools/server/README.md`). The server-specific params table at lines 178-179 documented two flags that don't exist on this fork:

Phantom flag Help text claims Reality
`--talker-model FILE` "enables the /v1/audio/speech endpoint" No `talker` in `common/arg.cpp` or any C++ source; `/v1/audio/speech` is not a registered route (only `/v1/audio/transcriptions` exists via `routes.post_transcriptions_oai` in `server.cpp:198`)
`--code2wav-model FILE` "qwen3-omni code2wav gguf" No `code2wav` in arg.cpp or C++ source; only appears in `conversion/qwen3.py` as a tensor-name prefix in the GGUF converter, which is unrelated to runtime flags

The webui does reference `/v1/audio/speech` — but only in its TTS client code (calling out to an external OpenAI-compatible TTS server), not as something llama-server itself serves.

The real vocoder-related flags (`--model-vocoder`, `--tts-use-guide-tokens`) are still documented elsewhere and unchanged.

Test plan

  • Confirmed via grep that neither flag string exists in arg.cpp or any `.cpp`/`.h` outside conversion scripts.
  • Confirmed only `/v1/audio/transcriptions` is registered (no `/v1/audio/speech`).
  • Other 22 documented endpoints all match actual routes (audit pass).
  • (post-merge) markdown table renders cleanly on GitHub.

Audit scope

Spot-checked 6 documented defaults (`--ctx-checkpoints`, `--models-max`, `--cache-ram`, `--slot-prompt-similarity`, `--reasoning`, `--models-autoload`) — all match `common/common.h` values. So no broader sweep needed.

🤖 Generated with Claude Code

Epoch #81 task 5 (docs review). Both rows in the server-specific
params table reference qwen3-omni TTS support that does not exist:

* Neither --talker-model nor --code2wav-model is registered in
  common/arg.cpp; no C++ source mentions the strings "talker" or
  "code2wav" (only model-conversion code in conversion/qwen3.py
  references them as tensor name prefixes, which is unrelated).
* The /v1/audio/speech endpoint the help text promises is also
  absent — only /v1/audio/transcriptions is wired up via
  routes.post_transcriptions_oai in server.cpp:198. The /v1/audio/speech
  string appears only in the webui's TTS *client* code (it calls out
  to a separate OpenAI-compatible TTS server, not back to llama-server).

The vocoder-related flags that DO exist (--model-vocoder,
--tts-use-guide-tokens) are still documented elsewhere and unchanged.
@marksverdhei marksverdhei merged commit 05586f2 into ht Jun 12, 2026
1 check passed
@marksverdhei marksverdhei deleted the docs/server-readme-phantom-flags branch June 12, 2026 18:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant