model: llama-embed-nemotron-8b by ybabakhin · Pull Request #3407 · embeddings-benchmark/mteb

ybabakhin · 2025-10-17T21:22:51Z

I have filled out the ModelMeta object to the extent possible
I have ensured that my model can be loaded using
- mteb.get_model(model_name, revision) and
- mteb.get_model_meta(model_name, revision)
I have tested the implementation works on a representative set of tasks.
The model is public, i.e. is available either as an API or the wieght are publicly avaiable to download

Samoed · 2025-10-17T22:37:19Z

Do you have plans to integrate your omnin embed model? We're releasing v2 version on Monday with better support for multimodality

Samoed · 2025-10-17T23:18:09Z

+        model_name,
+        revision,
+        max_seq_length=4096,
+        batch_size=4,


Suggested change

batch_size=4,

I added batch_size handling from encode_kwargs, but some of the benchmarks are getting GPU OOM now. Is it a user's responsibility to specify a proper encode_kwargs={"batch_size": 4} argument?

Well, it really depends on the system that they are on. Currently, the default is 32, but it might be ideal to lower that. Unsure if it is better to get the OOM and adjust it down to a reasonable level, rather than have it at a too low default. I might be leaning toward OOM being better

Right, I think, 32 is a reasonable default choice. Actually, I was getting those OOMs for version 1.39.7 which had 128 default for some problem types

Samoed · 2025-10-17T23:19:07Z

+            with torch.inference_mode():
+                inputs = self.tokenizer(
+                    batch, 
+                    max_length=self.max_seq_length, 


I think this would be better to specify in tokenization config

Here is our current state with a context length:

Base Llama-3.1-8B supports 128k

We've tested our llama-embed-nemotron-8b with the context length up to 32k, which we report in the metadata

We've ran the evaluation with 4k context length

So, our config has a theoretical 128k limit, but 4k is here for eval reproducibility

boliu61 · 2025-10-17T23:29:40Z

Do you have plans to integrate your omnin embed model? We're releasing v2 version on Monday with better support for multimodality

Hi @Samoed, do you mean v2 of M-MTEB? What will happen to the current M-MTEB leaderboard on Monday?

By integrating, you mean combining llama-embed-nemotron-8b and omni-embed-nemotron-3b? We don't have this plan as of now

Samoed · 2025-10-18T07:03:47Z

do you mean v2 of M-MTEB?

I mean this library

What will happen to the current M-MTEB leaderboard on Monday?

Nothing, it will be unchanged

By integrating, you mean combining llama-embed-nemotron-8b and omni-embed-nemotron-3b

No, add it as separate model, because it's omni is multimodal, but nemotron is text only

ybabakhin · 2025-10-18T08:09:21Z

I've found the change log here: https://embeddings-benchmark.github.io/mteb/whats_new/, looks nice!

@Samoed Which existing/upcoming Leaderboards would you suggest for the Omni model?

Samoed · 2025-10-18T08:25:15Z

Created issue about discussion of omni model #3411 where we can continue discussion. I will update this PR a bit to align with v2

ybabakhin · 2025-10-18T08:30:25Z

I will update this PR a bit to align with v2

Thanks! I added a few changes + lint

@Samoed Do you have any ETA when this model can make it to MMTEB Leaderboard? Do we have to wait for 2.0.0 release, or it can be published earlier?

Samoed · 2025-10-18T08:35:46Z

If you can wait a bit, it’ll be easier to add the model to v2, and it will appear on the leaderboard on Monday with the release of the second version.

# Conflicts: # mteb/models/nvidia_models.py

Samoed · 2025-10-18T09:15:41Z

@ybabakhin I've aligned your model with v2 version and added your name to contacts as part of #3399. Can you try to run this implementation?

ybabakhin · 2025-10-18T12:40:53Z

@Samoed I added a small fix to make it work with v2.0.0. batch_size is still being passed in kwargs, even though DataLoader is provided explicitly now.

New eval code works fine:

import mteb

model_name = "nvidia/llama-embed-nemotron-8b"

model = mteb.get_model(model_name)
tasks = mteb.get_tasks(tasks=["HagridRetrieval"])

mteb.evaluate(
    model,
    tasks,
    encode_kwargs={"batch_size": 4},
)

I'm only getting TOKENIZERS_PARALLELISM warnings:

huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks...
To disable this warning, you can either:
        - Avoid using `tokenizers` before the fork if possible
        - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false)

I will run it on more tasks to check if there are any discrepancies. Shall I also update a model_meta.json to a new 2.0.0 format in embeddings-benchmark/results#302?

Samoed · 2025-10-18T12:49:51Z

batch_size is still being passed in kwargs, even though DataLoader is provided explicitly now.

Yes, this is for models that wouldn't use dataloaders directly (e.g. sentence transformers).

Shall I also update a model_meta.json to a new 2.0.0 format

That would be nice, but this is minor

ybabakhin · 2025-10-18T12:57:22Z

@Samoed some tests are failing, but I don't think it is related to the changes in this PR

Samoed · 2025-10-18T13:00:17Z

Yes, I see. This is a flaky test that we’re currently working to fix.

ybabakhin · 2025-10-19T11:53:50Z

@Samoed , @KennethEnevoldsen can you, please, merge this PR now? Also, is v2.0.0 release still planned for tomorrow?

Samoed · 2025-10-19T12:18:58Z

Yes, it will be released tomorrow. This pr will be merged and Kenneth finish review of results

add llama-embed-nemotron-8b

1336f56

ybabakhin mentioned this pull request Oct 17, 2025

MMTEB results for llama-embed-nemotron-8b embeddings-benchmark/results#302

Merged

6 tasks

Samoed reviewed Oct 17, 2025

View reviewed changes

Samoed mentioned this pull request Oct 18, 2025

Add model: nvidia/omni-embed-nemotron-3b #3411

Closed

take batch size from encode_kwargs + lint

5f7c8d7

Samoed changed the base branch from main to v2.0.0 October 18, 2025 09:00

Samoed added 2 commits October 18, 2025 12:02

Merge branch 'v2.0.0' into llama-embed-nemotron-8b

917aac6

# Conflicts: # mteb/models/nvidia_models.py

align with v2

1563b2f

Samoed requested a review from KennethEnevoldsen October 18, 2025 09:16

Samoed added the new model Questions related to adding a new model to the benchmark label Oct 18, 2025

fix for v2.0.0

ef5e26d

KennethEnevoldsen approved these changes Oct 19, 2025

View reviewed changes

Comment thread mteb/models/model_implementations/nvidia_models.py

Merge branch 'v2.0.0' into llama-embed-nemotron-8b

6f1259d

ybabakhin mentioned this pull request Oct 19, 2025

Add training artifacts for llama-embed-nemotron-8b model #3428

Closed

Samoed merged commit d1ce6aa into embeddings-benchmark:v2.0.0 Oct 19, 2025
11 checks passed

KennethEnevoldsen mentioned this pull request Oct 20, 2025

Switch CI to UV #3438

Closed

Uh oh!

Conversation

ybabakhin commented Oct 17, 2025

Uh oh!

Samoed commented Oct 17, 2025

Uh oh!

Uh oh!

Samoed Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

ybabakhin Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

KennethEnevoldsen Oct 19, 2025

Choose a reason for hiding this comment

Uh oh!

ybabakhin Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

ybabakhin Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

boliu61 commented Oct 17, 2025

Uh oh!

Samoed commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ybabakhin commented Oct 18, 2025

Uh oh!

Samoed commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ybabakhin commented Oct 18, 2025

Uh oh!

Samoed commented Oct 18, 2025

Uh oh!

Samoed commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ybabakhin commented Oct 18, 2025

Uh oh!

Samoed commented Oct 18, 2025

Uh oh!

ybabakhin commented Oct 18, 2025

Uh oh!

Samoed commented Oct 18, 2025

Uh oh!

Uh oh!

ybabakhin commented Oct 19, 2025

Uh oh!

Samoed commented Oct 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Samoed commented Oct 18, 2025 •

edited

Loading

Samoed commented Oct 18, 2025 •

edited

Loading

Samoed commented Oct 18, 2025 •

edited

Loading