Skip to content

feat(media): add parakeet-mlx CLI output support#9177

Merged
steipete merged 1 commit intoopenclaw:mainfrom
mac-110:fix/parakeet-mlx-output
Mar 2, 2026
Merged

feat(media): add parakeet-mlx CLI output support#9177
steipete merged 1 commit intoopenclaw:mainfrom
mac-110:fix/parakeet-mlx-output

Conversation

@mac-110
Copy link

@mac-110 mac-110 commented Feb 4, 2026

Summary

Add native support for reading parakeet-mlx output files in resolveCliOutput.

Problem

Parakeet-mlx is a fast, local speech-to-text model (based on NVIDIA Parakeet) that runs on Apple Silicon via MLX. It writes transcripts to --output-dir/filename.txt, but OpenClaw's resolveCliOutput only supported whisper, whisper-cli, gemini, and sherpa-onnx-offline.

Without this fix, users need a wrapper script that outputs to stdout instead.

Solution

Add resolveParakeetOutputPath() helper function that:

  • Looks for --output-dir argument (note: hyphen, not underscore like whisper)
  • Constructs the output path as outputDir/mediaBasename.txt
  • Returns null if output-format is not txt

Config Example

{
  "tools": {
    "media": {
      "audio": {
        "models": [{
          "type": "cli",
          "command": "parakeet-mlx",
          "args": ["{{MediaPath}}", "--output-format", "txt", "--output-dir", "{{OutputDir}}"]
        }]
      }
    }
  }
}

Note

Previous issue #7552 was incorrectly auto-closed as a duplicate of #7536 (Windows path bug), which is unrelated. This PR addresses the original feature request.

Greptile Overview

Greptile Summary

This PR extends resolveCliOutput in src/media-understanding/runner.ts to support parakeet-mlx by resolving the transcript file path from CLI arguments (using --output-dir/--output-format) and reading {{OutputDir}}/{{mediaBasename}}.txt when present.

The change fits the existing CLI-provider pattern already used for whisper/whisper-cli, where transcript output may be written to a temp output directory and then ingested back into the media understanding pipeline.

Confidence Score: 5/5

  • This PR is safe to merge with minimal risk.
  • The change is small, localized to CLI output resolution, and follows the existing pattern used for other CLI transcribers (resolve output path then read file if present). No definite functional regressions were identified from the diff.
  • No files require special attention

(4/5) You can add custom instructions or style guidelines for the agent here!

@openclaw-barnacle
Copy link

This pull request has been automatically marked as stale due to inactivity.
Please add updates or it will be closed.

@openclaw-barnacle openclaw-barnacle bot added stale Marked as stale due to inactivity and removed stale Marked as stale due to inactivity labels Feb 21, 2026
@mudrii

This comment was marked as spam.

@steipete steipete force-pushed the fix/parakeet-mlx-output branch from 1a04ecc to ff21091 Compare March 2, 2026 22:22
@steipete steipete merged commit d89c25d into openclaw:main Mar 2, 2026
@steipete
Copy link
Contributor

steipete commented Mar 2, 2026

Landed via temp rebase onto main.

  • Gate: pnpm test src/media-understanding/apply.test.ts (pass); pnpm check (blocked by unrelated existing type errors in extensions/zalouser and src/gateway/server-reload-handlers.ts)
  • Land commit: LAND_SHA_PLACEHOLDER
  • Merge commit: MERGE_SHA_PLACEHOLDER

Thanks @mac-110!

@steipete
Copy link
Contributor

steipete commented Mar 2, 2026

Landed via temp rebase onto main.

  • Gate: pnpm test src/media-understanding/apply.test.ts (pass); pnpm check (blocked by unrelated existing type errors in extensions/zalouser and src/gateway/server-reload-handlers.ts)
  • Land commit: ff21091
  • Merge commit: d89c25d

Thanks @mac-110!

@openclaw-barnacle openclaw-barnacle bot added docs Improvements or additions to documentation size: S labels Mar 2, 2026
dawi369 pushed a commit to dawi369/davis that referenced this pull request Mar 3, 2026
OWALabuy pushed a commit to kcinzgg/openclaw that referenced this pull request Mar 4, 2026
zooqueen pushed a commit to hanzoai/bot that referenced this pull request Mar 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

docs Improvements or additions to documentation size: S

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants