Skip to content

Rename VultronRetrieverPrime-Qwen3.5-8B results to the vultr org#576

Merged
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:move-vultronprime-to-vultr-org-results
Jun 19, 2026
Merged

Rename VultronRetrieverPrime-Qwen3.5-8B results to the vultr org#576
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:move-vultronprime-to-vultr-org-results

Conversation

@athrael-soju

Copy link
Copy Markdown
Contributor

Follow-up to #575. The model moved on the Hub from athrael-soju/VultronRetrieverPrime-Qwen3.5-8B to vultr/VultronRetrieverPrime-Qwen3.5-8B. This renames the results directory results/athrael-soju__VultronRetrieverPrime-Qwen3.5-8B/ -> results/vultr__VultronRetrieverPrime-Qwen3.5-8B/ and updates name + reference in model_meta.json to match.

Merge order: please merge after mteb#4836 lands — the results name vultr/VultronRetrieverPrime-Qwen3.5-8B resolves via get_model_meta against the renamed ModelMeta, so this should go second (same ModelMeta-first order as the original submission).

The model moved on the Hub from athrael-soju/ to vultr/. Rename the results
directory athrael-soju__VultronRetrieverPrime-Qwen3.5-8B ->
vultr__VultronRetrieverPrime-Qwen3.5-8B (revision
e8f3104b743a04b0d5f715b67117d687ae99ce51 preserved across the transfer) and
update name + reference in model_meta.json. The 22 task result JSONs are
unchanged (no org string in them); no scores change.

Companion to the ModelMeta rename embeddings-benchmark/mteb#4836; merges after it.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@Samoed Samoed enabled auto-merge (squash) June 19, 2026 15:34
@github-actions

Copy link
Copy Markdown

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: vultr/VultronRetrieverPrime-Qwen3.5-8B

Results for vultr/VultronRetrieverPrime-Qwen3.5-8B

task_name vultr/VultronRetrieverPrime-Qwen3.5-8B Max result Model with max result In Training Data
Vidore2BioMedicalLecturesRetrieval .663 .670 DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16 False
Vidore2ESGReportsHLRetrieval .768 .791 DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16 False
Vidore2ESGReportsRetrieval .658 .660 OpenSearch-AI/Ops-Colqwen3-4B False
Vidore2EconomicsReportsRetrieval .638 .658 DataScience-UIBK/Argus-Colqwen3.5-9b-v0 False
Vidore3ComputerScienceRetrieval .798 .809 webAI-Official/webAI-ColVec1-9b False
Vidore3EnergyRetrieval .703 .698 nvidia/nemotron-colembed-vl-8b-v2 False
Vidore3FinanceEnRetrieval .690 .685 webAI-Official/webAI-ColVec1-4b False
Vidore3FinanceFrRetrieval .545 .537 webAI-Official/webAI-ColVec1-9b False
Vidore3HrRetrieval .668 .700 webAI-Official/webAI-ColVec1-9b False
Vidore3IndustrialRetrieval .574 .572 webAI-Official/webAI-ColVec1-9b False
Vidore3PharmaceuticalsRetrieval .682 .673 webAI-Official/webAI-ColVec1-9b False
Vidore3PhysicsRetrieval .517 .508 nvidia/nemotron-colembed-vl-8b-v2 False
VidoreArxivQARetrieval .932 .938 VAGOsolutions/SauerkrautLM-ColQwen3-8b-v0.1 True
VidoreDocVQARetrieval .677 .687 webAI-Official/webAI-ColVec1-9b True
VidoreInfoVQARetrieval .941 .952 webAI-Official/webAI-ColVec1-9b True
VidoreShiftProjectRetrieval .946 .947 DataScience-UIBK/Argus-Colqwen3.5-4b-v0 False
VidoreSyntheticDocQAAIRetrieval .993 1.000 ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1 True
VidoreSyntheticDocQAEnergyRetrieval .968 .980 nvidia/llama-nemotron-colembed-vl-3b-v2 True
VidoreSyntheticDocQAGovernmentReportsRetrieval .973 .989 nvidia/nemotron-colembed-vl-8b-v2 True
VidoreSyntheticDocQAHealthcareIndustryRetrieval 1.000 1.000 VAGOsolutions/SauerkrautLM-ColQwen3-4b-v0.1 True
VidoreTabfquadRetrieval .964 .981 nvidia/nemotron-colembed-vl-4b-v2 True
VidoreTatdqaRetrieval .815 .857 DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16 True
Average .778 .786 nan -

Model have high performance on these tasks: Vidore3EnergyRetrieval,Vidore3FinanceEnRetrieval,Vidore3PharmaceuticalsRetrieval,Vidore3IndustrialRetrieval,Vidore3FinanceFrRetrieval,Vidore3PhysicsRetrieval


@Samoed Samoed merged commit 9b269a7 into embeddings-benchmark:main Jun 19, 2026
3 of 5 checks passed
@athrael-soju

Copy link
Copy Markdown
Contributor Author

@Samoed has the leaderboard been rebuilt? I can't see the model on the leaderboard on V1, V2, or V3?

@Samoed

Samoed commented Jun 20, 2026

Copy link
Copy Markdown
Member

Not yet. Some parts of new leaderboard are still under review and because of that there is no automatic CI for now. I'll update leaderboard soon

@Samoed

Samoed commented Jun 20, 2026

Copy link
Copy Markdown
Member

@athrael-soju Your model is available on leaderboard https://mteb-leaderboard.hf.space/models/vultr/VultronRetrieverPrime-Qwen3.5-8B

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants