TF port of Convnextv2 by IMvision12 · Pull Request #23155 · huggingface/transformers

IMvision12 · 2023-05-04T18:15:48Z

What does this PR do?

TF port of convnextv2

IMvision12 · 2023-05-04T18:24:32Z

While converting pt weights to TensorFlow I am getting this error:
how to solve this?

All PyTorch model weights were used when initializing TFConvNextV2ForImageClassification.

All the weights of TFConvNextV2ForImageClassification were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFConvNextV2ForImageClassification for predictions without further training.
Traceback (most recent call last):
  File "/usr/local/bin/transformers-cli", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.10/dist-packages/transformers/commands/transformers_cli.py", line 55, in main
    service.run()
  File "/usr/local/lib/python3.10/dist-packages/transformers/commands/pt_to_tf.py", line 344, in run
    raise ValueError(
ValueError: The cross-loaded TensorFlow model has different outputs, something went wrong!

List of maximum output differences above the threshold (5e-05):
logits: 3.871e+00

List of maximum hidden layer differences above the threshold (5e-05):
hidden_states[1]: 3.463e-01
hidden_states[2]: 1.682e+00
hidden_states[3]: 2.259e+01
hidden_states[4]: 6.839e-01

Code used:

!transformers-cli pt-to-tf --model-name facebook/convnextv2-nano-1k-224 --no-pr --local-dir /content/convnextv2-nano-1k-224

HuggingFaceDocBuilderDev · 2023-05-04T18:44:17Z

The documentation is not available anymore as the PR was closed or merged.

amyeroberts

Thanks for adding this! Overall the PR looks good, just some small nits here and there.

Regarding the hidden layer differences, the way to solve it is to find the line(s) of code in the TF model contributing to the difference. The best thing to do is to bisect through the layers and their output activations when the equivalent PT and TF model are fed the same input.

In the output of the conversion script, we can see that the difference between the pytorch and tensorflow hidden states for the first block is already ~0.3, which is large. As a large difference appears in the first stage, I would load a small model to just have one stage, and start comparing the PT and TF models form there e.g.:

from transformers import TFAutoModel, AutoModel

checkpoint = "facebook/convnextv2-tiny-1k-224"
pt_model = AutoModel.from_pretrained(checkpoint, num_stages=1)
tf_model = TFAutoModel.from_pretrained(checkpoint, from_pt=True, num_stages=1)

tests/models/convnextv2/test_modeling_tf_convnextv2.py

src/transformers/models/convnextv2/modeling_tf_convnextv2.py

tests/models/convnextv2/test_modeling_tf_convnextv2.py

src/transformers/models/convnextv2/modeling_tf_convnextv2.py

IMvision12 added 3 commits May 4, 2023 02:06

add tf convnextv2

8b74c3c

style

e017842

test

92729ea

Update dummy_tf_objects.py

6a6eb94

amyeroberts reviewed May 5, 2023

View reviewed changes

IMvision12 and others added 3 commits May 7, 2023 17:31

update

cda44ec

Merge branch 'huggingface:main' into convnextv2

79e8eeb

Merge branch 'huggingface:main' into convnextv2

5d07c61

IMvision12 closed this May 17, 2023

IMvision12 deleted the convnextv2 branch May 17, 2023 20:10

neggles mentioned this pull request Aug 17, 2023

Add TensorFlow implementation of ConvNeXTv2 #25558

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF port of Convnextv2#23155

TF port of Convnextv2#23155
IMvision12 wants to merge 7 commits intohuggingface:mainfrom
IMvision12:convnextv2

IMvision12 commented May 4, 2023

Uh oh!

IMvision12 commented May 4, 2023

Uh oh!

HuggingFaceDocBuilderDev commented May 4, 2023 •

edited

Loading

Uh oh!

amyeroberts left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

IMvision12 commented May 4, 2023

What does this PR do?

Uh oh!

IMvision12 commented May 4, 2023

Uh oh!

HuggingFaceDocBuilderDev commented May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented May 4, 2023 •

edited

Loading