feat: evaluate arctic v2 models in MTEB(Medical) by dbuades · Pull Request #66 · embeddings-benchmark/results

dbuades · 2024-12-09T20:09:23Z

This PR evaluates the arctic-embed-v2.0 family of models by Snowflake on the MTEB(Medical) benchmark.

Checklist

Run tests locally to make sure nothing is broken using make test.
Run the results files checker make pre-push.

Adding a model checklist

Models are added to mteb/models in embeddings-benchmark/mteb#1574.

KennethEnevoldsen · 2024-12-10T19:38:49Z

+    ]
+  },
+  "evaluation_time": 33.124486207962036,
+  "kg_co2_emissions": null


I am a bit sad not to have the co2 usage, but otherwise, this looks fine

Is there any reason why codecarbon is currently not enabled by default? I'll modify my script to enable it from now on for new runs.

Actually, I'll just run it again with it enabled, it doesn't take long.

KennethEnevoldsen · 2024-12-10T19:39:43Z

@@ -0,0 +1 @@
+{"name": "Snowflake/snowflake-arctic-embed-m-v2.0", "revision": "f2a7d59d80dfda5b1d14f096f3ce88bb6bf9ebdc", "release_date": null, "languages": null, "n_parameters": null, "memory_usage": null, "max_tokens": null, "embed_dim": null, "license": "apache-2.0", "open_weights": null, "public_training_data": false, "public_training_code": null, "framework": ["PyTorch"], "reference": null, "similarity_fn_name": null, "use_instructions": null, "training_datasets": null, "adapted_from": null, "superseded_by": null, "loader": null}


a lot of the metadata seems to be missing here - is it not run using the implementation in the package? (loader: null)

It is using the package but without embeddings-benchmark/mteb#1574. It still used the sentence_transformers_loader. I let these models ran while I was making the other PR and then once the PR was finished I just ran the models again on a single task to validate that the previous results stayed the same. I'll upload the new metadata.

dbuades · 2024-12-10T23:24:58Z

All good @KennethEnevoldsen !

KennethEnevoldsen

Close - only a minor thing regarding use of instructions

feat: evaluate arctic v2 models in MTEB(Medical)

b4a70a0

dbuades mentioned this pull request Dec 9, 2024

feat: add new arctic v2.0 models embeddings-benchmark/mteb#1574

Merged

7 tasks

KennethEnevoldsen reviewed Dec 10, 2024

View reviewed changes

dbuades added 3 commits December 10, 2024 21:42

Merge branch 'main' into feat/medical-mteb-arctic-v2

e6a73a9

feat: track co2 emissions

a817b37

feat: new model_meta.json

1fa6b88

KennethEnevoldsen approved these changes Dec 11, 2024

View reviewed changes

Comment thread ...lake__snowflake-arctic-embed-l-v2.0/edc2df7b6c25794b340229ca082e7c78782e6374/model_meta.json Outdated

fix: set use_instructions to true

5c4432b

KennethEnevoldsen enabled auto-merge (squash) December 11, 2024 23:10

KennethEnevoldsen disabled auto-merge December 11, 2024 23:11

KennethEnevoldsen merged commit edb303f into embeddings-benchmark:main Dec 11, 2024

dbuades deleted the feat/medical-mteb-arctic-v2 branch December 13, 2024 13:18

dbuades mentioned this pull request Dec 20, 2024

fix: set use_instructions to True in models using prompts embeddings-benchmark/mteb#1616

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: evaluate arctic v2 models in MTEB(Medical)#66

feat: evaluate arctic v2 models in MTEB(Medical)#66
KennethEnevoldsen merged 5 commits into
embeddings-benchmark:mainfrom
clinia:feat/medical-mteb-arctic-v2

dbuades commented Dec 9, 2024

Uh oh!

KennethEnevoldsen Dec 10, 2024

Uh oh!

dbuades Dec 10, 2024 •

edited

Loading

Uh oh!

dbuades Dec 10, 2024

Uh oh!

KennethEnevoldsen Dec 10, 2024

Uh oh!

dbuades Dec 10, 2024

Uh oh!

dbuades commented Dec 10, 2024

Uh oh!

KennethEnevoldsen left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1 @@
		{"name": "Snowflake/snowflake-arctic-embed-m-v2.0", "revision": "f2a7d59d80dfda5b1d14f096f3ce88bb6bf9ebdc", "release_date": null, "languages": null, "n_parameters": null, "memory_usage": null, "max_tokens": null, "embed_dim": null, "license": "apache-2.0", "open_weights": null, "public_training_data": false, "public_training_code": null, "framework": ["PyTorch"], "reference": null, "similarity_fn_name": null, "use_instructions": null, "training_datasets": null, "adapted_from": null, "superseded_by": null, "loader": null} No newline at end of file

Uh oh!

Conversation

dbuades commented Dec 9, 2024

Checklist

Adding a model checklist

Uh oh!

KennethEnevoldsen Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

dbuades Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dbuades Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

dbuades Dec 10, 2024

Choose a reason for hiding this comment

Uh oh!

dbuades commented Dec 10, 2024

Uh oh!

KennethEnevoldsen left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dbuades Dec 10, 2024 •

edited

Loading

KennethEnevoldsen left a comment •

edited

Loading