feat: Use NVIDIA NIM ASR for audio transcription by MauroDruwel · Pull Request #53 · Alishahryar1/free-claude-code

MauroDruwel · 2026-02-24T18:30:36Z

Summary

Added NVIDIA NIM as a second transcription option ( alongside local Whisper). This lets you transcribe voice notes using NVIDIA's cloud API instead of running Whisper locally.

What changed

Transcription: Now supports the two backends
- Local Whisper: Free, runs on your GPU/CPU (existing)
- NVIDIA NIM: Cloud API via Riva gRPC (new)
Supported models: 8 NVIDIA NIM models added (Parakeet variants for different languages, Whisper Large V3)

Alishahryar1 · 2026-02-25T06:49:54Z

I decided to not add this because it adds an additional dependency and server outside of uv which is a hassle for users.

MauroDruwel · 2026-02-25T09:13:00Z

I decided to not add this because it adds an additional dependency and server outside of uv which is a hassle for users.

This is an optional dependecy, I need to update .toml to not be in voice but like voice_nim, not a new required dependency.
It actually has fewer dependencies than local Whisper (no PyTorch/CUDA/model downloads). And since the project already depends on NVIDIA NIM and requires an API key and NVIDIA servers, this does not introduce a new server or hassle for the users. Let me know why you think it might be a hassle please :)

Alishahryar1 · 2026-02-25T12:51:23Z

Ok sounds good as long as it can be with uv sync --voice and doesn't require launching an extra server. You can do it.

MauroDruwel · 2026-02-25T13:25:39Z

Ok sounds good as long as it can be with uv sync --voice and doesn't require launching an extra server. You can do it.

Isn't it better to use something like uv sync --voice_nim? Some dependencies in --voice are over 1 GB and aren't needed for my part, they’re only required for local Whisper, so downloading them would be pointless.

Alishahryar1 · 2026-02-25T13:26:57Z

Yes, that's better

MauroDruwel · 2026-02-26T20:59:51Z

Something to note: the project uses a very recent Python version (3.14), and wheels for grpcio-tools are not yet available. As a result, on a fresh system you need to install the required build dependencies:
sudo apt install build-essential cmake pkg-config

Fixed in b3d815c by using newer versions of grpcio then nvidia-riva-client initially offered

Alishahryar1 · 2026-02-27T06:45:42Z

Is it ready to be merged?

MauroDruwel · 2026-02-27T09:12:25Z

Is it ready to be merged?

Not yet, I still need to fix smth in transcription, I will mark as ready when I'm ready :)

MauroDruwel · 2026-02-28T13:47:01Z

@Alishahryar1 It's ready for review 😉
Feel free to test around and play with it first, I haven't done much testing, but seems to work and be quite fast aswell

Alishahryar1 · 2026-02-28T14:22:17Z

Some ci checks failing you can run those locally as well. It's best practice to run all checks locally before pushing.

… throw a name error

Alishahryar1 · 2026-02-28T16:48:25Z

LGTM. Great work!

## Summary Added NVIDIA NIM as a second transcription option ( alongside local Whisper). This lets you transcribe voice notes using NVIDIA's cloud API instead of running Whisper locally. ## What changed - **Transcription**: Now supports the two backends - Local Whisper: Free, runs on your GPU/CPU (existing) - NVIDIA NIM: Cloud API via Riva gRPC (new) - **Supported models**: 8 NVIDIA NIM models added (Parakeet variants for different languages, Whisper Large V3) --------- Co-authored-by: Alishahryar1 <alishahryar2@gmail.com>

MauroDruwel added 2 commits February 24, 2026 07:00

Add nvidia_nim whisper

8b2c4ac

Update to riva support

1a15c99

Alishahryar1 closed this Feb 25, 2026

Alishahryar1 reopened this Feb 25, 2026

MauroDruwel added 4 commits February 26, 2026 19:28

Default voice packages -> NVIDIA_NIM

637bf31

Remove duplicate

da0bd1a

Same sequence as .env.example

599eeec

set default

37a2756

Use newer versions of grpcio

b3d815c

Alishahryar1 reviewed Feb 27, 2026

View reviewed changes

Comment thread .env.example

MauroDruwel and others added 5 commits February 28, 2026 13:09

Clean up

fadeaee

Clean up unnecessary stuff

addedcb

Fix errors

82ae04e

Last final touches

ef874ee

Merge branch 'main' into useNvidiaNimASR

269eb02

MauroDruwel marked this pull request as ready for review February 28, 2026 13:39

MauroDruwel changed the title ~~Use nvidia nim asr~~ feat: Use NVIDIA NIM ASR for audio transcription Feb 28, 2026

uv ruff format

63bd865

MauroDruwel added 2 commits February 28, 2026 15:22

uv ruff heck fix

84ff4a7

Fix unresolved things

ca9d0fc