Skip to content

Feat/web chat multimodal#178

Merged
primoco merged 2 commits into
mainfrom
feat/web-chat-multimodal
Jun 7, 2026
Merged

Feat/web chat multimodal#178
primoco merged 2 commits into
mainfrom
feat/web-chat-multimodal

Conversation

@primoco

@primoco primoco commented Jun 7, 2026

Copy link
Copy Markdown
Collaborator

No description provided.

primoco added 2 commits June 7, 2026 11:03
Axum's DefaultBodyLimit is 2 MB. A base64-encoded image or audio clip on
/api/chat easily exceeds that (base64 inflates ~33%), so anything beyond a
small thumbnail returned `413: Failed to buffer the request body: length
limit exceeded`. The Windows test slipped through only because the sample
image was ~175 KB; a normal photo trips it immediately.

Apply DefaultBodyLimit::max(64 MB) to both the API and UI routers — large
enough for images and reasonable audio clips, still bounded against abuse.
Promote the multimodal MVP to the 0.6.0 release and document it.

* engine/Cargo.toml: 0.6.0-beta.8 → 0.6.0.
* README:
  - "What's ready today" table: Engine → v0.6.0; new Multimodal row
    (vision validated on Linux + Windows CUDA, audio experimental/CLI);
    Chat UI gains image attachment.
  - Engine key features: add the multimodal bullet.
  - New "Multimodal: vision + audio" subsection with CLI + Chat UI usage,
    plus an honest scope note (vision solid; audio experimental upstream;
    multimodal loads sequential; web UI = images only for now; Q4 quality
    bounds; Linux libnccl.so.2 dependency; vendored llama-cpp-rs pre-release).
  - Roadmap Phase 1: mark multimodal shipped; add planned embeddings
    endpoint for Ollama/OpenAI API parity.

Honest status: vision is runtime-validated on Linux + Windows CUDA (image
OCR + scene description, Chat UI and CLI). Audio understanding works via the
CLI but is experimental upstream and is not yet wired into the web UI.
@primoco primoco merged commit 9ad521c into main Jun 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant