Set scale_embedding to False in some TF tests by ydshieh · Pull Request #15952 · huggingface/transformers

ydshieh · 2022-03-05T16:37:26Z

What does this PR do?

This PR set scale_embedding=False in TFSpeech2TextModelTester to avoid inputs_embeds and the PT/TF difference being scaled by 4 - the objective is to keep a low tolerance 1e-5 in the PT/TF test, as we see several times this is a strong safe guard!

(This is not a real bug. It's also similar to #15684, where we got larger differences between PT/TF simply because the model weights are initialized with larger values).

TF: @gante @Rocketknight1
Speech: @patrickvonplaten

More context

Set scale_embedding to False in some TF tests.

Current Speech2TextConfig has default scale_embedding=True. Therefore we have

self.embed_scale = tf.math.sqrt(float(embed_dim)).
The tests for speech_to_text has hidden_size=16, and therefore inputs_embeds will be scaled by 4.

transformers/src/transformers/models/speech_to_text/modeling_tf_speech_to_text.py

Lines 843 to 844 in 9932ee4

    
           inputs_embeds = self.conv(inputs["input_features"]) 
        
           inputs_embeds = self.embed_scale * inputs_embeds

Since inputs_embeds here is obtained by some (conv.) layer instead of via look-up table, it contains some tiny difference between PT/TF. This difference is scaled by 4 through self.embed_scale.

This makes TFSpeech2TextModel the only one model that will fail the aggressive PT/TF test introduced in #15839 (with low tolerance 1e-5). More precisely, the output tensors failed are encoder_hidden_states_0 and encoder_hidden_states_1.

Results

With this PR, the tolerance 1e-5 works for all TF models' test_pt_tf_model_equivalence (in #15839), both on GPU / CPU, tested 100 times per model.

…es between PT/TF

HuggingFaceDocBuilderDev · 2022-03-05T16:42:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

set scale_embedding to False to avoid large (> 1e-5) output differenc…

42c2020

…es between PT/TF

ydshieh mentioned this pull request Mar 7, 2022

Make TF pt-tf equivalence test more aggressive #15839

Merged

gante approved these changes Mar 7, 2022

View reviewed changes

patrickvonplaten approved these changes Mar 7, 2022

View reviewed changes

A tiny fix: should be self.scale_embedding = scale_embedding

5057ce8

ydshieh merged commit 8b9ae45 into huggingface:master Mar 7, 2022

ydshieh deleted the set_scale_embedding_to_false_in_some_tf_tests branch March 7, 2022 21:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set scale_embedding to False in some TF tests#15952

Set scale_embedding to False in some TF tests#15952
ydshieh merged 2 commits intohuggingface:masterfrom
ydshieh:set_scale_embedding_to_false_in_some_tf_tests

ydshieh commented Mar 5, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	inputs_embeds = self.conv(inputs["input_features"])
	inputs_embeds = self.embed_scale * inputs_embeds

Conversation

ydshieh commented Mar 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

More context

Results

Uh oh!

HuggingFaceDocBuilderDev commented Mar 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydshieh commented Mar 5, 2022 •

edited

Loading