-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Description
Summary
During session commit, OpenViking recursively enqueues the session temp directory into SemanticQueue, which causes messages.jsonl (the archived session transcript) to be summarized via the generic file-summary prompt and sent to the configured VLM (ov-llm).
From a production usage perspective, this looks wasteful and low-value:
- it re-sends large chunks of already-known chat history to the model,
- increases token cost and latency,
- adds load to
ov-llm, - and does not seem to provide meaningful additional retrieval value for
messages.jsonlspecifically.
What I observed
The current flow appears to be:
session.commit()archives current messages.- It enqueues
session / user / agenttemp trees intoSemanticQueuewithrecursive=True. SemanticProcessorwalks files and summarizes them.messages.jsonlis treated as a generic text file and goes throughsemantic.file_summary.- The resulting prompt is literally:
Please generate a summary for the following file:
【File Name】
messages.jsonl
【File Content】
...
At runtime this produced repeated ov-llm requests containing historical session transcript content from messages.jsonl.
Why this seems problematic
messages.jsonl is not a normal user document. It is already the canonical session transcript / archive. Summarizing it again as a generic file seems to duplicate work already covered by:
- session structured summary / compression,
- memory extraction,
- and directory-level semantic generation.
So the current behavior feels like a side effect of the generic recursive semantic pipeline, rather than an intentional high-value feature for session archives.
Question
Is this behavior intentional?
If yes, what retrieval / indexing benefit is expected from generating a generic file summary for messages.jsonl after every commit?
Suggestion
Possible options:
- skip
messages.jsonlin semantic file-summary generation, - or special-case session archive files so they do not go through generic file summarization,
- or make this behavior configurable.
I think this would reduce unnecessary VLM traffic and make session commit cheaper / faster in production.
Metadata
Metadata
Assignees
Labels
Type
Projects
Status