test: Add script to test model loading below n_parameters threshold by isaac-chung · Pull Request #1698 · embeddings-benchmark/mteb

isaac-chung · 2025-01-03T08:56:23Z

first get the model meta from model registry and check for n_parameters, and omit API models.
if it is below threshold (2B for now), then run mteb.get_model and output the exception (None or raw text)
write these to a results.json file in the repo (in the scripts folder)
ability to continue running from where the script left off (args to specify running missing models, unsuccessful models, or by model names)
Trigger script to run when there are changes to the model files, then write the results out to the json file.

Checklist

Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

Samoed · 2025-01-03T09:19:44Z

I think we could create a file to record whether a model load was successful, to avoid repeated loading attempts.

isaac-chung · 2025-01-03T10:15:34Z

Right now the main issue is disk running out of space. Looking into how not to save the weights into cache folder when loading a model.

Samoed · 2025-01-04T08:00:57Z

Also, test should skip API based models

KennethEnevoldsen

Looking good so far. Just to clarify the intention is to move this into the tests suite?

isaac-chung · 2025-01-06T16:31:47Z

@KennethEnevoldsen thanks, and yes.

isaac-chung · 2025-01-06T22:19:52Z

@KennethEnevoldsen @Samoed just pushed the latest changes, and updated the description. Would you mind reviewing and seeing if anything is unclear?

I'll try to push a "test" commit to see how it works.

KennethEnevoldsen

Generally looks good, very happy to accept this with just the changes the dependecy installations.

isaac-chung · 2025-01-07T12:16:15Z

Thanks both for your input. Right now the test fails at the merge base step (empty list is returned). If you have some time, it would be great if you could help take a quick look there. Otherwise I plan to return to this later tomorrow.

…nto add-model-load-test-below-n_param_threshold

Samoed · 2025-01-07T15:53:36Z

I think the GitHub runner automatically merges branches during CI, so you don’t need to do it manually.

KennethEnevoldsen

Great work! I think we are almost there

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

Samoed

Great work!

isaac-chung · 2025-01-09T15:42:32Z

Thanks again. Merging now. Note that the model loading "test" will likely require merging the target branch to get the accurate diffs.

isaac-chung added 2 commits January 3, 2025 08:54

add model loading test for models below 2B params

76d2e96

add failure message to include model namne

a9d0c44

isaac-chung marked this pull request as draft January 3, 2025 08:56

use the real get_model_meta

766aad2

isaac-chung added 3 commits January 3, 2025 10:30

use cache folder

230d4f2

teardown per function

99abdb5

fix directory removal

0cbdaa0

isaac-chung mentioned this pull request Jan 3, 2025

Unittests to check model loading? #1690

Closed

isaac-chung added 5 commits January 4, 2025 13:36

write to file

59cc65b

wip loading from before

ea1d21f

wip

129e8cc

Rename model_loading_testing.py to model_loading.py

8fbb48f

Delete tests/test_models/test_model_loading.py

fb95ee7

Samoed reviewed Jan 4, 2025

View reviewed changes

Comment thread scripts/failures.json Outdated

isaac-chung added 2 commits January 5, 2025 09:36

checks for models below 2B

41c4b5c

try not using cache folder

9af61d0

isaac-chung changed the title ~~test: Add model load test below n param threshold~~ misc: Add script to test model loading below n_parameters threshold Jan 6, 2025

KennethEnevoldsen reviewed Jan 6, 2025

View reviewed changes

isaac-chung added 3 commits January 6, 2025 21:22

update script with scan_cache_dir and add args

b8777d1

add github CI: detect changed model files and run model loading test

bd56f86

install all model dependencies

dcdd80a

isaac-chung marked this pull request as ready for review January 6, 2025 22:17

KennethEnevoldsen requested changes Jan 7, 2025

View reviewed changes

Comment thread .github/workflows/model_loading.yml Outdated

Comment thread scripts/model_load_failures.json

Comment thread .github/workflows/model_loading.yml

Comment thread .github/workflows/model_loading.yml Outdated

Comment thread .github/workflows/model_loading.yml Outdated

isaac-chung added 2 commits January 7, 2025 09:54

dependecy installations and move file location

64d9c83

should trigger a model load test in CI

0eef873

isaac-chung added 3 commits January 7, 2025 10:49

try to run in python instead and add pytest

6fbaf0f

fix attribute error and add read mode

8830034

separate script calling

b1c2021

isaac-chung added 2 commits January 7, 2025 15:35

Merge branch 'main' of https://github.com/embeddings-benchmark/mteb i…

fc89ce0

…nto add-model-load-test-below-n_param_threshold

let pip install be cached and specify repo path

d843138

isaac-chung added 3 commits January 7, 2025 16:26

check ancestry

f994ab1

add cache and rebase

95d804d

try to merge instead of rebase

a85a2cd

Samoed reviewed Jan 7, 2025

View reviewed changes

Comment thread scripts/extract_model_names.py Outdated

isaac-chung added 2 commits January 8, 2025 13:52

try without merge base

609c883

check if file exists first

44ccf08

isaac-chung requested review from KennethEnevoldsen and Samoed January 8, 2025 14:26

KennethEnevoldsen reviewed Jan 8, 2025

View reviewed changes

isaac-chung and others added 2 commits January 8, 2025 17:28

Apply suggestions from code review

d479c5f

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

Update .github/workflows/model_loading.yml

fb26eab

Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

This was referenced Jan 8, 2025

Problem loading sentence-transformer/multi-qa-MiniLM-L6-cos-v1 #1726

Closed

How should we handle gated models when testing #1727

Closed

isaac-chung changed the title ~~misc: Add script to test model loading below n_parameters threshold~~ test: Add script to test model loading below n_parameters threshold Jan 8, 2025

Samoed reviewed Jan 8, 2025

View reviewed changes

Comment thread .github/workflows/model_loading.yml

KennethEnevoldsen approved these changes Jan 8, 2025

View reviewed changes

Comment thread .github/workflows/model_loading.yml

isaac-chung added 2 commits January 9, 2025 15:15

Merge branch 'main' into add-model-load-test-below-n_param_threshold

3dcaa96

address review comments to run test once from CI and not pytest

a9ffc88

isaac-chung merged commit 8d033f3 into main Jan 9, 2025

isaac-chung deleted the add-model-load-test-below-n_param_threshold branch January 9, 2025 15:42

Uh oh!

Conversation

isaac-chung commented Jan 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

Samoed commented Jan 3, 2025

Uh oh!

isaac-chung commented Jan 3, 2025

Uh oh!

Samoed commented Jan 4, 2025

Uh oh!

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

isaac-chung commented Jan 6, 2025

Uh oh!

isaac-chung commented Jan 6, 2025

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isaac-chung commented Jan 7, 2025

Uh oh!

Samoed commented Jan 7, 2025

Uh oh!

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Samoed left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

isaac-chung commented Jan 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

isaac-chung commented Jan 3, 2025 •

edited

Loading