mtmd: mtmd_audio_streaming_istft for audio output by tdakhran · Pull Request #18645 · ggml-org/llama.cpp

tdakhran · 2026-01-06T16:21:34Z

Change is decoupled from #18641.

LFM2.5-Audio-1.5B needs streaming istft for generating output audio.

add streaming ISTFT class (mtmd_audio_streaming_istft) with overlap-add for audio reconstruction
replace global audio cache with per-instance cache, the model requires two independent caches, for preprocessing (audio input) and for istft (audio output).
unified templated FFT/IFFT implementation supporting both forward and inverse transforms

Make sure to read the contributing guidelines before submitting a PR

Change is decoupled from ggml-org#18641. [LFM2.5-Audio-1.5B](https://huggingface.co/LiquidAI/LFM2.5-Audio-1.5B) needs streaming istft for generating output audio. * add streaming ISTFT class (`mtmd_audio_streaming_istft`) with overlap-add for audio reconstruction * replace global audio cache with per-instance cache, the model requires two independent caches, for preprocessing (audio input) and for istft (audio output). * unified templated FFT/IFFT implementation supporting both forward and inverse transforms

ngxson

nice, thanks! I ran the test and confirmed that this doesn't break other models

Change is decoupled from ggml-org#18641. [LFM2.5-Audio-1.5B](https://huggingface.co/LiquidAI/LFM2.5-Audio-1.5B) needs streaming istft for generating output audio. * add streaming ISTFT class (`mtmd_audio_streaming_istft`) with overlap-add for audio reconstruction * replace global audio cache with per-instance cache, the model requires two independent caches, for preprocessing (audio input) and for istft (audio output). * unified templated FFT/IFFT implementation supporting both forward and inverse transforms

tdakhran requested a review from ngxson as a code owner January 6, 2026 16:21

tdakhran mentioned this pull request Jan 6, 2026

[Do Not Merge] model : LFM2.5-Audio-1.5B #18641

Draft

6 tasks

ngxson approved these changes Jan 6, 2026

View reviewed changes

github-actions Bot added the examples label Jan 6, 2026

ngxson merged commit ccbc84a into ggml-org:master Jan 6, 2026
71 of 72 checks passed

elfarolab mentioned this pull request Jan 7, 2026

scripts : add pr2wt.sh #18644

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mtmd: mtmd_audio_streaming_istft for audio output#18645

mtmd: mtmd_audio_streaming_istft for audio output#18645
ngxson merged 1 commit into
ggml-org:masterfrom
tdakhran:tarek/dev/istft-upstream

tdakhran commented Jan 6, 2026

Uh oh!

ngxson left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tdakhran commented Jan 6, 2026

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants