Skip to content

DiscoCLIP loss#98

Merged
karinazad merged 13 commits intomainfrom
k/ume-contrastive-learnable-temperature
Jun 9, 2025
Merged

DiscoCLIP loss#98
karinazad merged 13 commits intomainfrom
k/ume-contrastive-learnable-temperature

Conversation

@karinazad
Copy link
Collaborator

@karinazad karinazad commented Jun 9, 2025

Training Ume medium and large with the contrastive objective is currently not possible because of CUDA OOM.

This MR implements DisCoCLIP which is supposed to be more memory efficient for distributed setting

See https://arxiv.org/pdf/2304.08480

@karinazad karinazad changed the title DiscoCLIP loss + learnable contrastive temperature DiscoCLIP loss Jun 9, 2025
@karinazad karinazad requested a review from ncfrey June 9, 2025 17:59
@@ -0,0 +1,6 @@
_target_: lightning.pytorch.callbacks.BatchSizeFinder
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this working now?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes on a single gpu it seems to be working

}


def is_distributed() -> bool:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we call this module _distributed_utils.py for clarity?

@karinazad karinazad merged commit 17c0773 into main Jun 9, 2025
5 checks passed
@karinazad karinazad deleted the k/ume-contrastive-learnable-temperature branch June 9, 2025 18:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants