Fix Dropout Implementation in Graphormer by alexanderkrauck · Pull Request #24817 · huggingface/transformers

alexanderkrauck · 2023-07-14T04:42:11Z

What does this PR do?

This commit corrects the dropout implementation in Graphormer, aligning it with the original implementation (https://github.com/microsoft/Graphormer) and improving performance. Specifically:

The attention_dropout variable, intended for use in GraphormerMultiheadAttention, was defined but not used. This has been corrected to use attention_dropout instead of the regular dropout.
The activation_dropout for the activations in the feed-forward layers was missing. Instead, the regular dropout was used. This commit adds activation_dropout to the feed-forward layers and to the GraphormerConfig including documentation.

These changes ensure the dropout implementation matches the original Graphormer and delivers empirically better performance.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@clefourrier

This commit corrects the dropout implementation in Graphormer, aligning it with the original implementation and improving performance. Specifically: 1. The `attention_dropout` variable, intended for use in GraphormerMultiheadAttention, was defined but not used. This has been corrected to use `attention_dropout` instead of the regular `dropout`. 2. The `activation_dropout` for the activations in the feed-forward layers was missing. Instead, the regular `dropout` was used. This commit adds `activation_dropout` to the feed-forward layers. These changes ensure the dropout implementation matches the original Graphormer and delivers empirically better performance.

alexanderkrauck · 2023-07-25T08:17:18Z

Hi @clefourrier and others,
this is the first time contributing to Huggingface for me. In the course of my thesis I found some improvements/fixes to your current Graphormer implementation including this one which is quite simple but has a big impact and should be easy to review. I plan to make some more pull requests with more performance related changes to speed Graphormer up and possibly also to add the 3D version in the following days/weeks. Feel free to reach out.
Best wishes, Alexander

github-actions · 2023-09-03T08:02:43Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

alexanderkrauck · 2023-09-03T09:20:10Z

It still needs to be adressed! @clefourrier or whoever is responsible for it, am I doing something wrong with my pull request or what is taking so long for anyone to anwer?! How am I supposed to contribute if I am being ignored.

clefourrier · 2023-09-04T06:13:52Z

Hi @alexanderkrauck !
I have been very busy taking care of the Open LLM Leaderboard, and I put the graph transformers issues on the backburner for the summer. I was hoping to come back to this quite earlier than now, I'm very sorry about that.
I'll do my best to come back to these before the end of September

clefourrier

LGTM, thank you for this fix

clefourrier · 2023-09-08T11:40:05Z

@amyeroberts or @ArthurZucker ?

amyeroberts

Thanks for fixing!

This commit corrects the dropout implementation in Graphormer, aligning it with the original implementation and improving performance. Specifically: 1. The `attention_dropout` variable, intended for use in GraphormerMultiheadAttention, was defined but not used. This has been corrected to use `attention_dropout` instead of the regular `dropout`. 2. The `activation_dropout` for the activations in the feed-forward layers was missing. Instead, the regular `dropout` was used. This commit adds `activation_dropout` to the feed-forward layers. These changes ensure the dropout implementation matches the original Graphormer and delivers empirically better performance.

clefourrier approved these changes Sep 8, 2023

View reviewed changes

amyeroberts approved these changes Sep 8, 2023

View reviewed changes

amyeroberts merged commit 0c67a72 into huggingface:main Sep 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Dropout Implementation in Graphormer#24817

Fix Dropout Implementation in Graphormer#24817
amyeroberts merged 1 commit intohuggingface:mainfrom
alexanderkrauck:fixing_dropout

alexanderkrauck commented Jul 14, 2023 •

edited

Loading

Uh oh!

alexanderkrauck commented Jul 25, 2023

Uh oh!

github-actions bot commented Sep 3, 2023

Uh oh!

alexanderkrauck commented Sep 3, 2023

Uh oh!

clefourrier commented Sep 4, 2023

Uh oh!

clefourrier left a comment

Uh oh!

clefourrier commented Sep 8, 2023

Uh oh!

amyeroberts left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alexanderkrauck commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

alexanderkrauck commented Jul 25, 2023

Uh oh!

github-actions bot commented Sep 3, 2023

Uh oh!

alexanderkrauck commented Sep 3, 2023

Uh oh!

clefourrier commented Sep 4, 2023

Uh oh!

clefourrier left a comment

Choose a reason for hiding this comment

Uh oh!

clefourrier commented Sep 8, 2023

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alexanderkrauck commented Jul 14, 2023 •

edited

Loading