Fix LayerScale ignoring init_values by Ilya-Fradlin · Pull Request #2605 · huggingface/pytorch-image-models

Ilya-Fradlin · 2025-11-04T17:42:05Z

Previously, reset_parameters() set gamma to ones, ignoring init_values and effectively disabling LayerScale’s small-init behavior.

Now, gamma is initialized to init_values via nn.init.constant_ preserving the intended effect.

rwightman · 2025-11-04T18:53:24Z

@Ilya-Fradlin thanks, FYI I have quite a bit set of changes like/related to this to implement a two phase init, but will merge this one regardless. EDIT woops I cut that off... becuase there's a regression here, I removed a number of local 'LayerScale' impl that did not have this issue and used this central one which does have the problematic reset... was probably in my changes a month back.

HuggingFaceDocBuilderDev · 2025-11-04T18:54:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Refactor layer_scale to use init_values for gamma

8a2fb11

rwightman merged commit ce73a2c into huggingface:main Nov 4, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix LayerScale ignoring init_values#2605

Fix LayerScale ignoring init_values#2605
rwightman merged 1 commit intohuggingface:mainfrom
Ilya-Fradlin:patch-1

Ilya-Fradlin commented Nov 4, 2025

Uh oh!

rwightman commented Nov 4, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Ilya-Fradlin commented Nov 4, 2025

Uh oh!

rwightman commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rwightman commented Nov 4, 2025 •

edited

Loading