docs(server): drop phantom --talker-model + --code2wav-model rows by marksverdhei · Pull Request #85 · heiervang-technologies/ht-llama.cpp

marksverdhei · 2026-06-05T20:15:52Z

Summary

Epoch #81 task 5 (docs review on `tools/server/README.md`). The server-specific params table at lines 178-179 documented two flags that don't exist on this fork:

Phantom flag	Help text claims	Reality
`--talker-model FILE`	"enables the /v1/audio/speech endpoint"	No `talker` in `common/arg.cpp` or any C++ source; `/v1/audio/speech` is not a registered route (only `/v1/audio/transcriptions` exists via `routes.post_transcriptions_oai` in `server.cpp:198`)
`--code2wav-model FILE`	"qwen3-omni code2wav gguf"	No `code2wav` in arg.cpp or C++ source; only appears in `conversion/qwen3.py` as a tensor-name prefix in the GGUF converter, which is unrelated to runtime flags

The webui does reference `/v1/audio/speech` — but only in its TTS client code (calling out to an external OpenAI-compatible TTS server), not as something llama-server itself serves.

The real vocoder-related flags (`--model-vocoder`, `--tts-use-guide-tokens`) are still documented elsewhere and unchanged.

Test plan

Confirmed via grep that neither flag string exists in arg.cpp or any `.cpp`/`.h` outside conversion scripts.
Confirmed only `/v1/audio/transcriptions` is registered (no `/v1/audio/speech`).
Other 22 documented endpoints all match actual routes (audit pass).
(post-merge) markdown table renders cleanly on GitHub.

Audit scope

Spot-checked 6 documented defaults (`--ctx-checkpoints`, `--models-max`, `--cache-ram`, `--slot-prompt-similarity`, `--reasoning`, `--models-autoload`) — all match `common/common.h` values. So no broader sweep needed.

🤖 Generated with Claude Code

Epoch #81 task 5 (docs review). Both rows in the server-specific params table reference qwen3-omni TTS support that does not exist: * Neither --talker-model nor --code2wav-model is registered in common/arg.cpp; no C++ source mentions the strings "talker" or "code2wav" (only model-conversion code in conversion/qwen3.py references them as tensor name prefixes, which is unrelated). * The /v1/audio/speech endpoint the help text promises is also absent — only /v1/audio/transcriptions is wired up via routes.post_transcriptions_oai in server.cpp:198. The /v1/audio/speech string appears only in the webui's TTS *client* code (it calls out to a separate OpenAI-compatible TTS server, not back to llama-server). The vocoder-related flags that DO exist (--model-vocoder, --tts-use-guide-tokens) are still documented elsewhere and unchanged.

This was referenced Jun 5, 2026

Hivemind Maintenance Tasks Epoch 3 #81

Closed

Hivemind Maintenance Tasks Epoch 4 #86

Closed

marksverdhei merged commit 05586f2 into ht Jun 12, 2026
1 check passed

marksverdhei deleted the docs/server-readme-phantom-flags branch June 12, 2026 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(server): drop phantom --talker-model + --code2wav-model rows#85

docs(server): drop phantom --talker-model + --code2wav-model rows#85
marksverdhei merged 1 commit into
htfrom
docs/server-readme-phantom-flags

marksverdhei commented Jun 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

marksverdhei commented Jun 5, 2026

Summary

Test plan

Audit scope

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant