feat(audio): auto-echo transcription to chat before agent processing by AytuncYildizli · Pull Request #32150 · openclaw/openclaw

AytuncYildizli · 2026-03-02T20:32:28Z

Summary

When a user sends a voice message, the transcription is now echoed back to the chat before the agent processes it — so users can immediately verify what was heard.

What changed

Config (`tools.media.audio`)

Two new optional fields on MediaUnderstandingConfig:

{
  "tools": {
    "media": {
      "audio": {
        "echoTranscript": true,          // default: false
        "echoFormat": "🎙️ Heard: {transcript}"  // default: '📝 "{transcript}"'
      }
    }
  }
}

Echo logic (`src/media-understanding/apply.ts`)

After transcription succeeds and ctx.Transcript is set, if echoTranscript: true:

Resolves channel and recipient from ctx.Provider/ctx.Surface and ctx.OriginatingTo/ctx.From
Guards on isDeliverableMessageChannel — silently skips for internal/non-deliverable channels
Calls deliverOutboundPayloads with bestEffort: true — delivery failure never blocks transcription
Formats the message via the configurable echoFormat template ({transcript} placeholder)

Schema docs

src/config/schema.help.ts — help text for both new fields
src/config/schema.labels.ts — UI labels for both new fields

Tests (`src/media-understanding/apply.echo-transcript.test.ts`)

10 test cases:

Case	Expected
`echoTranscript: false` (explicit)	no echo
`echoTranscript` absent (default)	no echo
`echoTranscript: true`	echo sent with default format
custom `echoFormat`	echo uses custom template
no audio attachment	no echo
transcription fails	no echo
channel not deliverable	no echo
ctx has no `From`/`OriginatingTo`	no echo
`OriginatingTo` present, `From` absent	echo uses `OriginatingTo`
echo delivery throws	transcription still succeeds, no throw

Checklist

Closes #32102

When echoTranscript is enabled in tools.media.audio config, the transcription text is sent back to the originating chat immediately after successful audio transcription — before the agent processes it. This lets users verify what was heard from their voice note. Changes: - config/types.tools.ts: add echoTranscript (bool) and echoFormat (string template) to MediaUnderstandingConfig - media-understanding/apply.ts: sendTranscriptEcho() helper that resolves channel/to from ctx, guards on isDeliverableMessageChannel, and calls deliverOutboundPayloads best-effort - config/schema.help.ts: help text for both new fields - config/schema.labels.ts: labels for both new fields - media-understanding/apply.echo-transcript.test.ts: 10 vitest cases covering disabled/enabled/custom-format/no-audio/failed-transcription/ non-deliverable-channel/missing-from/OriginatingTo/delivery-failure Default echoFormat: '📝 "{transcript}"' Closes openclaw#32102

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ad765098d9

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-02T20:38:10Z

src/config/types.tools.ts

+  echoTranscript?: boolean;
+  /**
+   * Format string for the echoed transcript. Use `{transcript}` as placeholder.
+   * Default: '📝 "{transcript}"'
+   */


Add audio echo options to validated config schema

The new tools.media.audio.echoTranscript / echoFormat settings are declared and documented here, but they were not added to the runtime Zod schema (src/config/zod-schema.core.ts, ToolsMediaUnderstandingSchema at lines 674-685), which is .strict(). As a result, any config that follows this commit’s docs and sets these keys will fail validation as unrecognized fields, so the feature cannot be enabled through normal config loading.

Useful? React with 👍 / 👎.

greptile-apps · 2026-03-02T20:38:48Z

Greptile Summary

This PR adds a new optional feature to echo audio transcriptions back to the chat before agent processing, allowing users to verify what was heard from their voice messages.

Key changes:

Added two new optional config fields: echoTranscript (default: false) and echoFormat (default: '📝 "{transcript}"')
Implemented echo logic in src/media-understanding/apply.ts that sends the transcript via deliverOutboundPayloads with bestEffort: true
Echo is only sent when transcription succeeds, the channel is deliverable, and recipient info is available
Comprehensive test coverage with 10 test cases covering all edge cases including delivery failures, missing context, and non-deliverable channels

Implementation quality:

Follows existing codebase patterns and style
Properly defensive with guards for missing channel/recipient and non-deliverable channels
Best-effort delivery ensures echo failures never block transcription
Dynamic import of deliverOutboundPayloads avoids potential circular dependencies
Config schema documentation is consistent and complete

The implementation is clean, well-tested, and handles edge cases appropriately. No issues found.

Confidence Score: 5/5

This PR is safe to merge with no identified risks
Score reflects excellent code quality with comprehensive test coverage (10 test cases), proper error handling, defensive programming patterns, and minimal scope of changes. The feature is opt-in by default, uses best-effort delivery to avoid breaking existing functionality, and all edge cases are properly handled and tested.
No files require special attention

_{Last reviewed commit: ad76509}

steipete · 2026-03-02T21:47:13Z

Landed via temp rebase onto main.

Gate: pnpm vitest src/media-understanding/apply.echo-transcript.test.ts src/media-understanding/apply.test.ts
Land commit: 327de97da1a2ea68dd60e90b4bc7f8cec8e24214
Merge commit: 8f995df

Thanks @AytuncYildizli!

…32150)

…penclaw#32150)

openclaw-barnacle bot added the size: M label Mar 2, 2026

chatgpt-codex-connector bot reviewed Mar 2, 2026

View reviewed changes

fix(audio): add echoTranscript/echoFormat to Zod config schema

290f8d5

steipete merged commit 8f995df into openclaw:main Mar 2, 2026
24 of 26 checks passed

steipete added a commit that referenced this pull request Mar 2, 2026

fix: keep audio transcript echo off-by-default and tiny-audio-safe (#…

8e259b8

…32150)

github-actions bot mentioned this pull request Mar 2, 2026

📡 Upstream Digest — 2026-03-02 22:20 UTC curtismercier/openclaw-mods#167

Open

dawi369 pushed a commit to dawi369/davis that referenced this pull request Mar 3, 2026

fix: keep audio transcript echo off-by-default and tiny-audio-safe (o…

cd73d61

…penclaw#32150)

OWALabuy pushed a commit to kcinzgg/openclaw that referenced this pull request Mar 4, 2026

fix: keep audio transcript echo off-by-default and tiny-audio-safe (o…

b234497

…penclaw#32150)

zooqueen pushed a commit to hanzoai/bot that referenced this pull request Mar 6, 2026

fix: keep audio transcript echo off-by-default and tiny-audio-safe (o…

e2cc40e

…penclaw#32150)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(audio): auto-echo transcription to chat before agent processing#32150

feat(audio): auto-echo transcription to chat before agent processing#32150
steipete merged 2 commits intoopenclaw:mainfrom
AytuncYildizli:feat/audio-echo-transcript

AytuncYildizli commented Mar 2, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 2, 2026

Uh oh!

greptile-apps bot commented Mar 2, 2026

Uh oh!

Uh oh!

steipete commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

AytuncYildizli commented Mar 2, 2026

Summary

What changed

Config (tools.media.audio)

Echo logic (src/media-understanding/apply.ts)

Schema docs

Tests (src/media-understanding/apply.echo-transcript.test.ts)

Checklist

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Mar 2, 2026

Greptile Summary

Confidence Score: 5/5

Uh oh!

Uh oh!

steipete commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Config (`tools.media.audio`)

Echo logic (`src/media-understanding/apply.ts`)

Tests (`src/media-understanding/apply.echo-transcript.test.ts`)