[CLAP] Fix logit scales dtype for fp16 by sanchit-gandhi · Pull Request #25754 · huggingface/transformers

sanchit-gandhi · 2023-08-25T11:17:38Z

What does this PR do?

On some hardware, taking torch.log of a tensor in float16 on the CPU fails:

in __init__(self, config)
   1956         audio_config = config.audio_config
   1957 
-> 1958         self.logit_scale_a = nn.Parameter(torch.log(torch.tensor(config.logit_scale_init_value)))
   1959         self.logit_scale_t = nn.Parameter(torch.log(torch.tensor(config.logit_scale_init_value)))
   1960 

RuntimeError: "log_vml_cpu" not implemented for 'Half'

Note that this only failed for me on a Colab T4, but not on a Titan RTX (used to test #25682).

Let's take math.log then convert it to a tensor - this will respect the dtype of the model but not take torch.log of a float16 CPU param.

HuggingFaceDocBuilderDev · 2023-08-25T11:37:41Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

On CPU you can't use half anyway no?

sanchit-gandhi · 2023-08-25T12:29:59Z

Yep try with this:

import torch

torch.tensor(0).half().dtype()

Gives:

tensor(0., dtype=torch.float16)

Is used when we load diffusers pipelines in fp16 (load state dict in fp16 on cpu then move to cuda)

[CLAP] Fix logit scales dtype for fp16

d5332ac

sanchit-gandhi requested a review from ArthurZucker August 25, 2023 11:17

ArthurZucker approved these changes Aug 25, 2023

View reviewed changes

sanchit-gandhi merged commit 0770ce6 into huggingface:main Aug 25, 2023

sanchit-gandhi deleted the clap-dtype branch August 25, 2023 12:30

parambharat pushed a commit to parambharat/transformers that referenced this pull request Sep 26, 2023

[CLAP] Fix logit scales dtype for fp16 (huggingface#25754)

8bdeb54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CLAP] Fix logit scales dtype for fp16#25754

[CLAP] Fix logit scales dtype for fp16#25754
sanchit-gandhi merged 1 commit intohuggingface:mainfrom
sanchit-gandhi:clap-dtype

sanchit-gandhi commented Aug 25, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 25, 2023 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

sanchit-gandhi commented Aug 25, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sanchit-gandhi commented Aug 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi commented Aug 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sanchit-gandhi commented Aug 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 25, 2023 •

edited

Loading

sanchit-gandhi commented Aug 25, 2023 •

edited

Loading