Releases: mudler/parakeet.cpp
Releases · mudler/parakeet.cpp
v0.2.0
What's Changed
- feat: banded local (Longformer) attention — fix O(T^2) long-audio OOM by @localai-bot in #9
- Add nemotron-3.5-asr-streaming-0.6b (multilingual prompt-conditioned streaming) by @localai-bot in #10
- Batch mode for nemotron: batched causal subsampling + batched target_lang C-API by @localai-bot in #11
- docs: supported-models table in the README by @localai-bot in #12
- docs(bench): nemotron GPU numbers on the GB10 by @localai-bot in #14
- fix: reset the streaming decoder on / so transcription continues (#13) by @localai-bot in #15
- feat(capi): segment-timestamp support (frame_sec + streaming JSON, ABI v4) by @localai-bot in #16
- fix: define M_PI for MSVC builds by @mvanhorn in #18
- fix: tile subsampling for long audio to avoid ggml 2^31 tensor overflow on GPU by @localai-bot in #19
- fix: select integrated GPUs and allow PARAKEET_DEVICE to name a device by @localai-bot in #20
- ci: pre-built release binaries for linux, macos and windows by @localai-bot in #22
New Contributors
- @localai-bot made their first contribution in #9
- @mvanhorn made their first contribution in #18
Full Changelog: v0.1.2...v0.2.0
v0.1.2
v0.1.1
Full Changelog: https://github.com/mudler/parakeet.cpp/commits/v0.1.1
v0.1.0: parakeet.cpp: C++/ggml port of NVIDIA NeMo Parakeet ASR
Self-contained snapshot. FastConformer TDT / CTC / RNNT / hybrid models with a log-mel front-end, CPU and GPU (CUDA / HIP / Vulkan / Metal) ggml graphs, quantization (f16, q8_0, q6_k, q5_k, q4_k), a CLI, and a flat C-API (include/parakeet_capi.h) consumed by the LocalAI parakeet-cpp backend. Includes the NeMo parity suite and HF publishing tooling (scripts/publish_hf.py -> mudler/parakeet-cpp-gguf). Assisted-by: Claude:claude-opus-4-8 [Claude Code]