Skip to content

Adding results italian datasets PR#234

Merged
isaac-chung merged 2 commits into
embeddings-benchmark:mainfrom
MattiaSangermano:main
Jul 7, 2025
Merged

Adding results italian datasets PR#234
isaac-chung merged 2 commits into
embeddings-benchmark:mainfrom
MattiaSangermano:main

Conversation

@MattiaSangermano

@MattiaSangermano MattiaSangermano commented Jul 6, 2025

Copy link
Copy Markdown
Contributor

Here I included the results coming from the mteb PR 2859. added results for models intfloat__multilingual-e5-small and sentence-transformers__paraphrase-multilingual-MiniLM-L12-v2 on DadoEvalCoarseClassification, DisCoTexPairClassification, EmitClassification, SardiStanceClassification and XGlueWPRReranking

Checklist

  • My model has a model sheet, report or similar
  • My model has a reference implementation in mteb/models/ this can be as an API. Instruction on how to add a model can be found here
    • No, but there is an existing PR ___
  • The results submitted is obtained using the reference implementation
  • My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
  • I solemnly swear that for all results submitted I have not on the evaluation dataset including training splits. If I have I have disclosed it clearly.

…-transformers__paraphrase-multilingual-MiniLM-L12-v2 on DadoEvalCoarseClassification, DisCoTexPairClassification, EmitClassification, SardiStanceClassification and XGlueWPRReranking
…-transformers__paraphrase-multilingual-MiniLM-L12-v2 on MKQARetrieval task

@isaac-chung isaac-chung left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!
@Samoed any idea if this error already has a fix planned/in place?

@Samoed

Samoed commented Jul 7, 2025

Copy link
Copy Markdown
Member

The error occurs because I'm using MTEB from the main branch, but this PR was created before the tasks were merged. I don't have any ideas on how to fix it (without manually triggering the CI with a different branch), except to rerun it manually after the tasks are merged.

@github-actions

github-actions Bot commented Jul 7, 2025

Copy link
Copy Markdown

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: intfloat/multilingual-e5-small, sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Tasks: DadoEvalCoarseClassification, DisCoTexPairClassification, EmitClassification, MKQARetrieval, SardiStanceClassification, XGlueWPRReranking

Results for intfloat/multilingual-e5-small

task_name intfloat/multilingual-e5-small Max result
DadoEvalCoarseClassification 0.51 0.51
DisCoTexPairClassification 0.69 0.69
EmitClassification 0.04 0.04
MKQARetrieval 0.06 0.06
SardiStanceClassification 0.42 0.42
XGlueWPRReranking 0.70 0.7
Average 0.40 0.4

Results for sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

task_name sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Max result
DadoEvalCoarseClassification 0.38 0.51
DisCoTexPairClassification 0.67 0.69
EmitClassification 0.03 0.04
MKQARetrieval 0.05 0.06
SardiStanceClassification 0.4 0.42
XGlueWPRReranking 0.68 0.70
Average 0.37 0.40

@isaac-chung

Copy link
Copy Markdown
Contributor

The error occurs because I'm using MTEB from the main branch, but this PR was created before the tasks were merged. I don't have any ideas on how to fix it (without manually triggering the CI with a different branch), except to rerun it manually after the tasks are merged.

I see, thanks. We might want to look into that separately. Let's create an issue for that.

@isaac-chung isaac-chung merged commit 0353d7b into embeddings-benchmark:main Jul 7, 2025
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants