Update `BridgeTowerModelTester` by ydshieh · Pull Request #23029 · huggingface/transformers

ydshieh · 2023-04-27T14:42:50Z

What does this PR do?

Update BridgeTowerModelTester to use small values for config.

HuggingFaceDocBuilderDev · 2023-04-27T14:56:55Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2023-04-27T15:50:51Z

tests/models/bridgetower/test_modeling_bridgetower.py



-class BridgeTowerModelTester:
+class BridgeTowerTextModelTester:


There is no BridgeTowerTextModelTest however: we just use this tester class to create text config and text inputs

ydshieh · 2023-04-27T15:51:22Z

tests/models/bridgetower/test_modeling_bridgetower.py

+        )
+
+
+class BridgeTowerImageModelTester:


same as mentioned for text model tester above.

ydshieh · 2023-04-27T15:52:36Z

tests/models/bridgetower/test_modeling_bridgetower.py

+        hidden_size=128,
+        num_hidden_layers=2,
+        num_attention_heads=4,
+        intermediate_size=256,


This model requires some attributes to be defined in the top config (BridgeTowerConfig).

ydshieh · 2023-04-27T16:05:49Z

tests/models/bridgetower/test_modeling_bridgetower.py

    has_attentions = False

+    @unittest.skip(reason="Does not work on the tiny model as we keep hitting edge cases.")
+    def test_cpu_offload(self):


With large version, this test passes

ydshieh · 2023-04-27T16:06:20Z

tests/models/bridgetower/test_modeling_bridgetower.py

+        pass
+
+    @unittest.skip(reason="Does not work on the tiny model as we keep hitting edge cases.")
+    def test_disk_offload(self):


ydshieh · 2023-04-27T16:08:11Z

tests/models/bridgetower/test_modeling_bridgetower.py

+
+    @unittest.skip(reason="Does not work on the tiny model as we keep hitting edge cases.")
+    def test_model_parallelism(self):
+        pass


With large model, there is a device issue when running the forward pass.
I tried to look it, but constantly got GPU OOM. So I decided to update this test file.
I will take a look this test with larger model (but not too large)

ydshieh · 2023-04-27T16:13:32Z

tests/models/bridgetower/test_modeling_bridgetower.py

        return config, inputs_dict


-@slow


ydshieh · 2023-04-27T16:15:07Z

Remark: with lager model (but not too large), we get

FAILED tests/models/bridgetower/test_modeling_bridgetower.py::BridgeTowerModelTest::test_model_parallelism - RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!

Better to check this separately.

Here is the full log

>                   new_output = new_model(**inputs_dict_class)

tests/test_modeling_common.py:2616: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py:1501: in _call_impl
    return forward_call(*args, **kwargs)
/usr/local/lib/python3.8/dist-packages/accelerate/hooks.py:165: in new_forward
    output = old_forward(*args, **kwargs)
src/transformers/models/bridgetower/modeling_bridgetower.py:1423: in forward
    image_embeds = self.vision_model.visual.transformer.resblocks[i](image_embeds).type(
/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py:1501: in _call_impl
    return forward_call(*args, **kwargs)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = BridgeTowerResidualAttention(
  (attn): MultiheadAttention(
    (out_proj): NonDynamicallyQuantizableLinear(in_feature...ar(in_features=2048, out_features=512, bias=True)
  )
  (ln_2): LayerNorm((512,), eps=1e-05, elementwise_affine=True)
)
hidden_state = tensor([[[ 0.5531,  0.0555, -0.0248,  ...,  0.2110, -0.0403,  0.0487]],

        [[ 0.2963, -0.1709,  0.0074,  ...,  0...      [[ 0.3324, -0.0536, -0.0069,  ...,  0.0911, -0.0565, -0.2751]]],
       device='cuda:1', grad_fn=<ViewBackward0>)
attention_mask = None

    def forward(self, hidden_state: torch.Tensor, attention_mask: torch.Tensor = None):
        residual_state = hidden_state + self.attention(self.ln_1(hidden_state), attention_mask)
        hidden_state = self.ln_2(residual_state)
        for _, layer in self.mlp.items():
            hidden_state = layer(hidden_state)
>       hidden_state = residual_state + hidden_state
E       RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!

src/transformers/models/bridgetower/modeling_bridgetower.py:237: RuntimeError
================================================================================================== warnings summary ==================================================================================================
../usr/local/lib/python3.8/dist-packages/detectron2/data/transforms/transform.py:46
  /usr/local/lib/python3.8/dist-packages/detectron2/data/transforms/transform.py:46: DeprecationWarning: LINEAR is deprecated and will be removed in Pillow 10 (2023-07-01). Use BILINEAR or Resampling.BILINEAR instead.
    def __init__(self, src_rect, output_size, interp=Image.LINEAR, fill=0):

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================================================================================== short test summary info ===============================================================================================
FAILED tests/models/bridgetower/test_modeling_bridgetower.py::BridgeTowerModelTest::test_model_parallelism - RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!

sgugger

Thanks a lot!

* update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

update

84c0fe7

ydshieh added 8 commits April 27, 2023 16:57

fix

30253d6

fix

14fba32

fix

9f454fc

fix

7286fc6

fix

0c42ec0

fix

d3af1c4

fix

c4ae15c

fix

5d336be

ydshieh commented Apr 27, 2023

View reviewed changes

ydshieh marked this pull request as ready for review April 27, 2023 15:52

fix

4ceeb8f

ydshieh commented Apr 27, 2023

View reviewed changes

ydshieh requested a review from sgugger April 27, 2023 16:11

ydshieh commented Apr 27, 2023

View reviewed changes

tests/models/bridgetower/test_modeling_bridgetower.py

return config, inputs_dict

@slow

Copy link

Collaborator Author

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fast now

sgugger approved these changes Apr 27, 2023

View reviewed changes

ydshieh merged commit 27b66be into main Apr 27, 2023

ydshieh deleted the fix_bridge branch April 27, 2023 16:26

ydshieh mentioned this pull request May 23, 2023

Fix a BridgeTower test #23694

Merged

gojiteji pushed a commit to gojiteji/transformers that referenced this pull request Jun 5, 2023

Update BridgeTowerModelTester (huggingface#23029)

a08584c

* update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

novice03 pushed a commit to novice03/transformers that referenced this pull request Jun 23, 2023

Update BridgeTowerModelTester (huggingface#23029)

a3e10b0

* update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `BridgeTowerModelTester`#23029

Update `BridgeTowerModelTester`#23029
ydshieh merged 10 commits intomainfrom
fix_bridge

ydshieh commented Apr 27, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 27, 2023 •

edited

Loading

Uh oh!

ydshieh Apr 27, 2023

Uh oh!

ydshieh Apr 27, 2023

Uh oh!

ydshieh Apr 27, 2023

Uh oh!

ydshieh Apr 27, 2023 •

edited

Loading

Uh oh!

ydshieh Apr 27, 2023

Uh oh!

ydshieh Apr 27, 2023

Uh oh!

ydshieh Apr 27, 2023

Uh oh!

ydshieh commented Apr 27, 2023

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		class BridgeTowerModelTester:
		class BridgeTowerTextModelTester:

Conversation

ydshieh commented Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh Apr 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh Apr 27, 2023

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Apr 27, 2023

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ydshieh commented Apr 27, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 27, 2023 •

edited

Loading

ydshieh Apr 27, 2023 •

edited

Loading