Fixed incorrect normalization by audioXD · Pull Request #40436 · huggingface/transformers

audioXD · 2025-08-25T17:16:47Z

I've notices a possible typo in src/transformers/image_processing_utils_fast.py#compile_friendly_resize for uin8 the normalization is done slightly off with 256 instead of 255, which still works because its done consistenly (its normalized and denormalized the same way) incorrect.

image = image.float() / 256
image = image * 256

The exaplanation is simple:

255 (the max value for a uint8) should map to 1. but it doesn't with the current implementation

Rocketknight1 · 2025-08-26T12:44:56Z

cc @yonigozlan @qubvel

qubvel

Thanks, makes sense 👍

HuggingFaceDocBuilderDev · 2025-08-26T13:50:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yonigozlan · 2025-09-02T21:53:19Z

@remi-or I think you had used 256 on purpose? Can you check that this changes isn't breaking?

remi-or · 2025-09-02T21:56:42Z

@yonigozlan I think I tested it against torchvision.resize and parity test worked with 256. Maybe the snippet is in the PR?
edit: PR is here #38540

qubvel · 2025-09-02T22:09:37Z

src/transformers/image_processing_utils_fast.py

            image = torch.where(image > 255, 255, image)
            image = torch.where(image < 0, 0, image)


btw, it might be more optimal to use torch.clamp(image, 0, 255) once instead of torch.where twice

Normally I would agree! But this is on purpose : #38540

Ok, according to the PR it seems we have to revert this PR to use 256 and keep torch.where. To prevent this code from further regression we have to either add tests that fails on CI (cuda) if modified or properly comment the code

The regression this PR introduced already caused failures on the AMD CI, which is as important as NVIDIA (or cuda) CI!
As for properly commenting the code, both code paths where compile_friendly_resize is called are commented. You can check it out by expanding the diff, those lines are right above the function 🙂
If you want, we can add # this is to match torchvision.resize next to 256 and # We use torch.where instead of torch.clamp to avoid an error with torch.compile as comments to make sure no one will introduce the regression again. Wdyt?

@remi-or absolutely agree AMD CI is as important as NVIDIA, what I'm trying to say is that we need a test that fails in PR's CI to prevent merging this PR. In terms of comments, yeah, it's better to comment non-obvious code right in place, otherwise it looks like a typo and is easy to miss the comment located in a different part (and that's happened in this case).
I'll do a quick fix for this, thanks for jumping in and clarifying 🤗

Fixed incorrect normalization

8fd1eed

qubvel approved these changes Aug 26, 2025

View reviewed changes

qubvel enabled auto-merge (squash) August 26, 2025 13:42

qubvel merged commit 5a8ba87 into huggingface:main Aug 26, 2025
24 checks passed

qubvel reviewed Sep 2, 2025

View reviewed changes

qubvel mentioned this pull request Sep 3, 2025

Revert change in compile_friendly_resize #40645

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed incorrect normalization#40436

Fixed incorrect normalization#40436
qubvel merged 1 commit intohuggingface:mainfrom
audioXD:audioXD-patch-1

audioXD commented Aug 25, 2025

Uh oh!

Rocketknight1 commented Aug 26, 2025

Uh oh!

qubvel left a comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 26, 2025

Uh oh!

yonigozlan commented Sep 2, 2025

Uh oh!

remi-or commented Sep 2, 2025 •

edited

Loading

Uh oh!

qubvel Sep 2, 2025

Uh oh!

remi-or Sep 2, 2025

Uh oh!

qubvel Sep 3, 2025

Uh oh!

remi-or Sep 3, 2025

Uh oh!

qubvel Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

		image = torch.where(image > 255, 255, image)
		image = torch.where(image < 0, 0, image)

Conversation

audioXD commented Aug 25, 2025

Uh oh!

Rocketknight1 commented Aug 26, 2025

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 26, 2025

Uh oh!

yonigozlan commented Sep 2, 2025

Uh oh!

remi-or commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qubvel Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

remi-or Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

remi-or Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

remi-or commented Sep 2, 2025 •

edited

Loading