fix: forward use_cache kwarg to attention mixer in nemotron_h by CharlieKerfoot · Pull Request #45792 · huggingface/transformers

CharlieKerfoot · 2026-05-05T17:24:06Z

In src/transformers/models/nemotron_h/modular_nemotron_h.py:294 the attention mixer is called with user_cache=use_cache. The typo means use_cache is never forwarded and an unexpected user_cache kwarg gets passed through instead.

Simply, Rename the keyword argument from user_cache to use_cache so the flag actually reaches the attention mixer.

github-actions · 2026-05-05T17:25:21Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: nemotron_h

zucchini-nlp

Nice catch! TBH I am sure if we are keeping use_cache arg on purpose since it is not used anymore in latest releases. Mostly just passed as kwarg

Could you run make fix-repo to fxi CI?

HuggingFaceDocBuilderDev · 2026-05-05T18:22:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…gface#45792) * fix: forward use_cache kwarg to attention mixer in nemotron_h * Ran make fix-repo

fix: forward use_cache kwarg to attention mixer in nemotron_h

a03e2ed

zucchini-nlp approved these changes May 5, 2026

View reviewed changes

CharlieKerfoot force-pushed the fix/attention-mixer-invoked branch from 77148a6 to a03e2ed Compare May 5, 2026 17:40

Ran make fix-repo

5958f8e

zucchini-nlp enabled auto-merge May 5, 2026 18:17

zucchini-nlp added this pull request to the merge queue May 5, 2026

Merged via the queue into huggingface:main with commit 41c3a5a May 5, 2026
22 checks passed

Exile333 pushed a commit to Exile333/transformers that referenced this pull request May 6, 2026

fix: forward use_cache kwarg to attention mixer in nemotron_h (huggin…

6b1b392

…gface#45792) * fix: forward use_cache kwarg to attention mixer in nemotron_h * Ran make fix-repo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: forward use_cache kwarg to attention mixer in nemotron_h#45792

fix: forward use_cache kwarg to attention mixer in nemotron_h#45792
zucchini-nlp merged 2 commits into
huggingface:mainfrom
CharlieKerfoot:fix/attention-mixer-invoked

CharlieKerfoot commented May 5, 2026

Uh oh!

github-actions Bot commented May 5, 2026

Uh oh!

zucchini-nlp left a comment •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

CharlieKerfoot commented May 5, 2026

Uh oh!

github-actions Bot commented May 5, 2026

Uh oh!

zucchini-nlp left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp left a comment •

edited

Loading