Add DPT by NielsRogge · Pull Request #15991 · huggingface/transformers

NielsRogge · 2022-03-08T14:56:41Z

What does this PR do?

This PR adds DPT, Dense Prediction Transformers, to the library. It's some very nice work from Intel Labs that applies Transformers for dense prediction tasks such as semantic segmentation and depth estimation.

Feel free to play around with the notebook here.

I've defined 3 models:

DPTModel
DPTForDepthEstimation
DPTForSemanticSegmentation.

DPTModel is the backbone only (ViT in this case). The head models use a neck (DPTNeck) combined with a task-specific head (either DPTDepthEstimationHead or DPTSemanticSegmentationHead).

Important here:

a neck is an nn.Module that takes a list of tensors and produces another list of tensors.
a head takes a list of tensors and return logits.

To do:

make sure heads take a list of tensors as input
add tests for DPTFeatureExtractor
discuss out_indices and in_index names
transfer weights to Intel organization

HuggingFaceDocBuilderDev · 2022-03-08T15:01:02Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for your PR!

There are still a lot of things to clean up in the modeling file. Make sure to follow the usual style guidelines we have in place.

docs/source/index.mdx

docs/source/model_doc/dpt.mdx

src/transformers/__init__.py

src/transformers/models/dpt/modeling_dpt.py

src/transformers/models/dpt/test.py

tests/dpt/test_modeling_dpt.py

src/transformers/models/dpt/modeling_dpt.py

FrancescoSaverioZuppichini

Thanks for working on it, it looks like a very cool model. There some naming convention that must be applied

***Block/s -> ***Layer/s
use Stage when we stack multiple ***Layers together
when possible use hidden_size/s instead of channels

Some code can use some refactor to increase readability

src/transformers/models/dpt/configuration_dpt.py

src/transformers/models/dpt/modeling_dpt.py

src/transformers/models/dpt/configuration_dpt.py

docs/source/model_doc/dpt.mdx

src/transformers/models/dpt/modeling_dpt.py

FrancescoSaverioZuppichini · 2022-03-11T14:05:24Z

src/transformers/models/dpt/modeling_dpt.py

+        self.use_batch_norm = config.use_batch_norm
+        self.act1 = nn.ReLU()
+        self.conv1 = nn.Conv2d(
+            config.channels,
+            config.channels,
+            kernel_size=3,
+            stride=1,
+            padding=1,
+            bias=not self.use_batch_norm,
+        )
+
+        self.act2 = nn.ReLU()
+        self.conv2 = nn.Conv2d(
+            config.channels,
+            config.channels,
+            kernel_size=3,
+            stride=1,
+            padding=1,
+            bias=not self.use_batch_norm,
+        )
+
+        if self.use_batch_norm:
+            self.batch_norm1 = nn.BatchNorm2d(config.channels)
+            self.batch_norm2 = nn.BatchNorm2d(config.channels)


That could work, keep in mind I am not sure if we always prefer named layers vs unnamed ones (like stacking them in a sequential layer) so take my design comments with a grain of salt

src/transformers/models/dpt/modeling_dpt.py

FrancescoSaverioZuppichini · 2022-03-11T14:07:15Z

src/transformers/models/dpt/modeling_dpt.py

+        self.embeddings = DPTViTEmbeddings(config)
+        self.encoder = DPTViTEncoder(config)


Okay, why is not called DPTViTModel then?

src/transformers/models/dpt/modeling_dpt.py

FrancescoSaverioZuppichini

Thanks! A couple of comments about single variables and naming

NielsRogge · 2022-03-15T10:37:35Z

Thanks for your reviews, addressed most comments. Main thing to update is:

rename out_indices (which features to use from the backbone)
rename in_index (which features to use in the head)

Ideally we have names that are going to be used by all vision models.

src/transformers/models/dpt/modeling_dpt.py

FrancescoSaverioZuppichini

Thanks! Approved, there are still some minor comments about variables names

sgugger

There are still a few comments pending from the first review, highlighted them.

docs/source/model_doc/dpt.mdx

src/transformers/models/dpt/__init__.py

src/transformers/models/dpt/configuration_dpt.py

sgugger · 2022-03-21T15:38:52Z

src/transformers/models/dpt/configuration_dpt.py

+            The number of output channels for each of the four feature maps of the backbone.
+        channels (`int`, *optional*, defaults to 256):
+            The number of channels before fusion.
+        in_index (`int`, *optional*, defaults to -1):


I think you agreed to use head_in_indices here?

sgugger · 2022-03-21T15:41:56Z

src/transformers/models/dpt/modeling_dpt.py

This has not been addressed.

sgugger · 2022-03-21T15:42:03Z

src/transformers/models/dpt/modeling_dpt.py

Same for this one.

src/transformers/models/dpt/modeling_dpt.py

debtriche · 2022-10-11T09:24:10Z

stop with all the emails I’m getting mad

…

Sent from my iPhone

On Mar 11, 2022, at 8:37 AM, NielsRogge ***@***.***> wrote: @NielsRogge commented on this pull request. In src/transformers/models/dpt/modeling_dpt.py: > + if output_attentions: + all_self_attentions = all_self_attentions + (layer_outputs[1],) + + if output_hidden_states: + all_hidden_states = all_hidden_states + (hidden_states,) + + if not return_dict: + return tuple(v for v in [hidden_states, all_hidden_states, all_self_attentions] if v is not None) + return BaseModelOutput( + last_hidden_state=hidden_states, + hidden_states=all_hidden_states, + attentions=all_self_attentions, + ) + + +class DPTReassembleBlocks(nn.Module): Renamed to DPTReassembleStage — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you are subscribed to this thread.

NielsRogge requested review from FrancescoSaverioZuppichini, LysandreJik and sgugger March 8, 2022 14:56

sgugger reviewed Mar 8, 2022

View reviewed changes

NielsRogge force-pushed the add_dpt_redesign branch from 2225dc1 to 8dbfda7 Compare March 8, 2022 16:38

NielsRogge commented Mar 8, 2022

View reviewed changes

src/transformers/models/dpt/modeling_dpt.py Outdated Show resolved Hide resolved

FrancescoSaverioZuppichini reviewed Mar 9, 2022

View reviewed changes

NielsRogge commented Mar 11, 2022

View reviewed changes

src/transformers/models/dpt/configuration_dpt.py Outdated Show resolved Hide resolved

NielsRogge commented Mar 11, 2022

View reviewed changes

docs/source/model_doc/dpt.mdx Outdated Show resolved Hide resolved

FrancescoSaverioZuppichini reviewed Mar 11, 2022

View reviewed changes

FrancescoSaverioZuppichini reviewed Mar 14, 2022

View reviewed changes

src/transformers/models/dpt/modeling_dpt.py Show resolved Hide resolved

FrancescoSaverioZuppichini reviewed Mar 15, 2022

View reviewed changes