comparing fa2 by ncfrey · Pull Request #112 · prescient-design/lobster

ncfrey · 2025-06-17T17:50:07Z

No description provided.

Copilot

Pull Request Overview

This PR refines the attention and padding configuration for FA2 (flash vs SDPA attention) and adds helper scripts for comparing embedding consistency across modes.

Enhanced type safety with Literal for attention_layer and padding in FlexBertConfig
Unified padding/attention-mask logic in Ume.__init__ and new Ume.load_from_checkpoint
Added example and test scripts to validate SDPA vs flash-attention embedding consistency

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
src/lobster/model/modern_bert/_config.py	Added `Literal` type hints for `attention_layer` and `padding` parameters
src/lobster/model/_ume.py	Refactored padding/mask logic; introduced `load_from_checkpoint` method
src/lobster/model/README.md	Documented UME model and its dynamic checkpoint loader
slurm/scripts/test_fa2_ume.sh	New SLURM submission script for SDPA vs flash‐attention test
pyproject.toml	Pinned `triton<=3.1.0`; excluded `examples` from linting
examples/test_sdpa_unpadded_attention_mask.py	New test script to inspect attention masks
examples/test_embed_fa2.py	Simple embedding example for flash-attention
examples/compare_flash_attention_embeddings.py	Comprehensive script comparing embeddings across modes

Comments suppressed due to low confidence (4)

src/lobster/model/_ume.py:158

[nitpick] Configuration logic for padding and masks is duplicated between __init__ and load_from_checkpoint. Consider extracting this into a shared helper to adhere to DRY principles.

        if ckpt_path is not None:

src/lobster/model/_ume.py:791

Restricting device to exactly "cpu" or "cuda" excludes valid strings like "cuda:0" or custom devices. Consider accepting a torch.device object or allowing any string that starts with "cuda" for greater flexibility.

            if device not in ["cpu", "cuda"]:

src/lobster/model/_ume.py:752

The new load_from_checkpoint method isn’t covered by existing unit tests. Adding targeted tests for CPU vs GPU loading, verifying padding and use_sdpa_attn_mask settings, would help prevent regressions.

    def load_from_checkpoint(

src/lobster/model/README.md:3

[nitpick] The link reference to _ume.py may not resolve correctly in rendered markdown. Consider linking to the Ume class documentation or using a direct path to the file.

## [Universal Molecular Encoder](_ume.py)

examples/test_sdpa_unpadded_attention_mask.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

karinazad · 2025-06-17T17:56:09Z

examples/test_embed_fa2.py

let's rename test_ to something else so it's not accidentally picked up by pytest

karinazad · 2025-06-17T17:56:36Z

examples/test_sdpa_unpadded_attention_mask.py

same as above

karinazad · 2025-06-17T18:01:03Z

slurm/scripts/test_fa2_ume.sh

same as here

karinazad · 2025-06-17T18:27:35Z

src/lobster/model/_ume.py

+        """
+        # Determine device
+        if device is not None:
+            if device not in ["cpu", "cuda"]:


device could be something like "cuda:0" ?

freyn6 added 2 commits June 12, 2025 21:42

scripts

4488374

more fa2 configs

165f35b

ncfrey requested a review from Copilot June 17, 2025 17:50

ncfrey temporarily deployed to test.pypi.org June 17, 2025 17:50 — with GitHub Actions Inactive

ncfrey requested a review from karinazad June 17, 2025 17:50

Copilot AI reviewed Jun 17, 2025

View reviewed changes

examples/test_sdpa_unpadded_attention_mask.py Outdated Show resolved Hide resolved

Update examples/test_sdpa_unpadded_attention_mask.py

fab4b38

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

ncfrey had a problem deploying to test.pypi.org June 17, 2025 17:53 — with GitHub Actions Failure

Merge branch 'main' into n/comparing-fa2

8be21de

ncfrey temporarily deployed to test.pypi.org June 17, 2025 17:54 — with GitHub Actions Inactive

karinazad reviewed Jun 17, 2025

View reviewed changes

update optional deps

02acd82

karinazad reviewed Jun 17, 2025

View reviewed changes

examples/test_sdpa_unpadded_attention_mask.py

Copy link

Collaborator

karinazad Jun 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as above

ncfrey reacted with thumbs up emoji

ncfrey temporarily deployed to test.pypi.org June 17, 2025 17:56 — with GitHub Actions Inactive

rename

eab39a1

ncfrey temporarily deployed to test.pypi.org June 17, 2025 18:00 — with GitHub Actions Inactive

karinazad reviewed Jun 17, 2025

View reviewed changes

slurm/scripts/test_fa2_ume.sh

Copy link

Collaborator

karinazad Jun 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as here

karinazad reviewed Jun 17, 2025

View reviewed changes

karinazad approved these changes Jun 17, 2025

View reviewed changes

ncfrey merged commit fd2fbe2 into main Jun 17, 2025
5 checks passed

ncfrey deleted the n/comparing-fa2 branch June 17, 2025 21:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

comparing fa2#112

comparing fa2#112
ncfrey merged 6 commits intomainfrom
n/comparing-fa2

ncfrey commented Jun 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

karinazad Jun 17, 2025

Uh oh!

karinazad Jun 17, 2025

Uh oh!

karinazad Jun 17, 2025

Uh oh!

karinazad Jun 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ncfrey commented Jun 17, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

karinazad Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

karinazad Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

karinazad Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

karinazad Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants