fix(read): persist image data and inject MEDIA directive for channel delivery by QDenka · Pull Request #11754 · openclaw/openclaw

QDenka · 2026-02-08T08:11:31Z

Summary

When the read tool reads an image file, the base64 image data is returned as a content block visible to the LLM but never converted to a deliverable media URL. This means images read by agents are not sent to Telegram or other channels — the user only sees the agent's text reply without the image.

Changes

src/agents/pi-tools.read.ts: After reading an image, persist the base64 data to a cache file under .openclaw/media-cache/ in the workspace and inject a MEDIA:./relative-path directive into the text content block. The existing delivery pipeline then picks up the relative path via splitMediaFromOutput and sends the image to the channel.
src/agents/pi-tools.ts: Pass workspaceRoot to createOpenClawReadTool for the non-sandboxed path.
src/agents/pi-tools.read.image-delivery.test.ts: Tests verifying MEDIA injection for image reads, no injection for text reads, and no injection without workspaceRoot.

How it works

Agent calls read on an image file
Read tool returns { type: 'image', data: '<base64>', mimeType: 'image/png' } content block
New: The image data is persisted to .openclaw/media-cache/<hash>.png
New: A MEDIA:./… directive is appended to the text content block
The delivery pipeline (splitMediaFromOutput) extracts the media URL
Image is sent to Telegram/other channels via sendMedia

Fixes #11735

Greptile Overview

Greptile Summary

This PR updates the read tool wrapper so that when an image is read (base64 image content block), the image payload is persisted into a workspace-local cache directory (.openclaw/media-cache/) and a MEDIA: directive is appended to the tool’s text output. The existing media parsing/delivery pipeline (splitMediaFromOutput → channel sendMedia) can then detect the directive and deliver the image to downstream channels. It also wires workspaceRoot through the non-sandboxed tool creation path and adds a Vitest suite covering the injection behavior.

Confidence Score: 2/5

This PR has a few correctness issues that can break media delivery and should be fixed before merging.
The core idea (persist + MEDIA directive) fits the existing splitMediaFromOutput pipeline, but the current directive/path formatting and injection behavior are inconsistent with how MEDIA tokens are parsed/consumed, and the persistence step can produce invalid/corrupted files without detection. These are likely to cause real delivery failures or duplicated MEDIA extraction in normal operation.
src/agents/pi-tools.read.ts (MEDIA path format, injection target, image persistence validation); src/agents/pi-tools.read.image-delivery.test.ts (would currently fail once path semantics are corrected/validated)

_{(4/5) You can add custom instructions or style guidelines for the agent here!}

…delivery When the read tool reads an image file, the base64 image data is returned as a content block visible to the LLM but never converted to a deliverable media URL. This means images read by agents are not sent to Telegram or other channels. Fix: after reading an image, persist the base64 data to a cache file under .openclaw/media-cache/ in the workspace and inject a MEDIA: directive into the text content block. The delivery pipeline then picks up the relative path and sends the image to the channel. Fixes openclaw#11735

greptile-apps

_{1 file reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-08T08:13:08Z

src/agents/pi-tools.read.ts

+): Promise<string | undefined> {
+  const ext = MIME_TO_EXT[imageBlock.mimeType] ?? "png";
+  const hash = createHash("sha256")
+    .update(imageBlock.data.slice(0, 1024))


Broken MEDIA path

persistReadImage returns a path like ./.openclaw/media-cache/..., but later code (and the test) treat the MEDIA: payload as a relative path without the leading ./ (see splitMediaFromOutput which returns ./... and the test’s match(/MEDIA:\.\/(.+)/)). With the current implementation, splitMediaFromOutput will parse MEDIA:././.openclaw/... and the extracted token becomes ././.openclaw/..., which will not map cleanly back to the workspace when consumers do join(workspaceRoot, token).

This breaks delivery for the generated directives unless the receiver happens to normalize ././.

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-tools.read.ts Line: 304:307 Comment: **Broken MEDIA path** `persistReadImage` returns a path like `./.openclaw/media-cache/...`, but later code (and the test) treat the `MEDIA:` payload as a *relative path without the leading `./`* (see `splitMediaFromOutput` which returns `./...` and the test’s `match(/MEDIA:\.\/(.+)/)`). With the current implementation, `splitMediaFromOutput` will parse `MEDIA:././.openclaw/...` and the extracted token becomes `././.openclaw/...`, which will not map cleanly back to the workspace when consumers do `join(workspaceRoot, token)`. This breaks delivery for the generated directives unless the receiver happens to normalize `././`. How can I resolve this? If you propose a fix, please make it concise.

src/agents/pi-tools.read.ts

greptile-apps · 2026-02-08T08:13:10Z

src/agents/pi-tools.read.ts

+function injectMediaDirective(
+  result: AgentToolResult<unknown>,
+  mediaPath: string,
+): AgentToolResult<unknown> {
+  const content = Array.isArray(result.content) ? result.content : [];
+  const nextContent = content.map((block) => {
+    if (
+      block &&
+      typeof block === "object" &&
+      (block as { type?: unknown }).type === "text" &&
+      typeof (block as { text?: unknown }).text === "string"
+    ) {
+      const b = block as TextContentBlock & { text: string };
+      return { ...b, text: `${b.text}\nMEDIA:${mediaPath}` } satisfies TextContentBlock;
+    }


MEDIA injected into all text blocks

injectMediaDirective appends MEDIA: to every text block in the tool result. If read returns multiple text blocks (e.g., header + extra notes/errors), this will create multiple MEDIA: tokens; splitMediaFromOutput will then extract duplicates and also strip those lines from the user-visible text.

This should only inject into a single, intended text block (typically the first/header), or ensure only one directive is appended.

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-tools.read.ts Line: 327:341 Comment: **MEDIA injected into all text blocks** `injectMediaDirective` appends `MEDIA:` to *every* text block in the tool result. If `read` returns multiple text blocks (e.g., header + extra notes/errors), this will create multiple `MEDIA:` tokens; `splitMediaFromOutput` will then extract duplicates and also strip those lines from the user-visible text. This should only inject into a single, intended text block (typically the first/header), or ensure only one directive is appended. How can I resolve this? If you propose a fix, please make it concise.

openclaw-barnacle · 2026-02-21T04:23:27Z

This pull request has been automatically marked as stale due to inactivity.
Please add updates or it will be closed.

openclaw-barnacle bot added the agents Agent runtime and tooling label Feb 8, 2026

greptile-apps bot reviewed Feb 8, 2026

View reviewed changes

Merge branch 'openclaw:main' into fix/issue-11735

92d4590

christianklotz self-assigned this Feb 14, 2026

christianklotz mentioned this pull request Feb 15, 2026

fix: deliver tool result media when verbose is off #16679

Merged

2 tasks

christianklotz removed their assignment Feb 15, 2026

thewilloftheshadow force-pushed the main branch from bfc1ccb to f92900f Compare February 15, 2026 18:46

openclaw-barnacle bot added stale Marked as stale due to inactivity and removed stale Marked as stale due to inactivity labels Feb 21, 2026

QDenka closed this Feb 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(read): persist image data and inject MEDIA directive for channel delivery#11754

fix(read): persist image data and inject MEDIA directive for channel delivery#11754
QDenka wants to merge 2 commits intoopenclaw:mainfrom
QDenka:fix/issue-11735

QDenka commented Feb 8, 2026 •

edited by greptile-apps bot

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Feb 8, 2026

Uh oh!

Uh oh!

greptile-apps bot Feb 8, 2026

Uh oh!

openclaw-barnacle bot commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

QDenka commented Feb 8, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

How it works

Greptile Overview

Greptile Summary

Confidence Score: 2/5

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

greptile-apps bot Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

openclaw-barnacle bot commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

QDenka commented Feb 8, 2026 •

edited by greptile-apps bot

Loading