Rename VultronRetrieverPrime-Qwen3.5-8B results to the vultr org by athrael-soju · Pull Request #576 · embeddings-benchmark/results

athrael-soju · 2026-06-19T15:19:26Z

Follow-up to #575. The model moved on the Hub from athrael-soju/VultronRetrieverPrime-Qwen3.5-8B to vultr/VultronRetrieverPrime-Qwen3.5-8B. This renames the results directory results/athrael-soju__VultronRetrieverPrime-Qwen3.5-8B/ -> results/vultr__VultronRetrieverPrime-Qwen3.5-8B/ and updates name + reference in model_meta.json to match.

Revision e8f3104b743a04b0d5f715b67117d687ae99ce51 is preserved across the transfer (it's the <name>/<revision>/ subdir key), so the 22 task result JSONs move unchanged and no scores change.
Pairs with the ModelMeta rename model: update VultronRetrieverPrime-Qwen3.5-8B repo path to the vultr org mteb#4836.

Merge order: please merge after mteb#4836 lands — the results name vultr/VultronRetrieverPrime-Qwen3.5-8B resolves via get_model_meta against the renamed ModelMeta, so this should go second (same ModelMeta-first order as the original submission).

The model moved on the Hub from athrael-soju/ to vultr/. Rename the results directory athrael-soju__VultronRetrieverPrime-Qwen3.5-8B -> vultr__VultronRetrieverPrime-Qwen3.5-8B (revision e8f3104b743a04b0d5f715b67117d687ae99ce51 preserved across the transfer) and update name + reference in model_meta.json. The 22 task result JSONs are unchanged (no org string in them); no scores change. Companion to the ModelMeta rename embeddings-benchmark/mteb#4836; merges after it. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

github-actions · 2026-06-19T15:36:57Z

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: vultr/VultronRetrieverPrime-Qwen3.5-8B

Results for `vultr/VultronRetrieverPrime-Qwen3.5-8B`

task_name	vultr/VultronRetrieverPrime-Qwen3.5-8B	Max result	Model with max result	In Training Data
Vidore2BioMedicalLecturesRetrieval	.663	.670	DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16	False
Vidore2ESGReportsHLRetrieval	.768	.791	DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16	False
Vidore2ESGReportsRetrieval	.658	.660	OpenSearch-AI/Ops-Colqwen3-4B	False
Vidore2EconomicsReportsRetrieval	.638	.658	DataScience-UIBK/Argus-Colqwen3.5-9b-v0	False
Vidore3ComputerScienceRetrieval	.798	.809	webAI-Official/webAI-ColVec1-9b	False
Vidore3EnergyRetrieval	.703	.698	nvidia/nemotron-colembed-vl-8b-v2	False
Vidore3FinanceEnRetrieval	.690	.685	webAI-Official/webAI-ColVec1-4b	False
Vidore3FinanceFrRetrieval	.545	.537	webAI-Official/webAI-ColVec1-9b	False
Vidore3HrRetrieval	.668	.700	webAI-Official/webAI-ColVec1-9b	False
Vidore3IndustrialRetrieval	.574	.572	webAI-Official/webAI-ColVec1-9b	False
Vidore3PharmaceuticalsRetrieval	.682	.673	webAI-Official/webAI-ColVec1-9b	False
Vidore3PhysicsRetrieval	.517	.508	nvidia/nemotron-colembed-vl-8b-v2	False
VidoreArxivQARetrieval	.932	.938	VAGOsolutions/SauerkrautLM-ColQwen3-8b-v0.1	True
VidoreDocVQARetrieval	.677	.687	webAI-Official/webAI-ColVec1-9b	True
VidoreInfoVQARetrieval	.941	.952	webAI-Official/webAI-ColVec1-9b	True
VidoreShiftProjectRetrieval	.946	.947	DataScience-UIBK/Argus-Colqwen3.5-4b-v0	False
VidoreSyntheticDocQAAIRetrieval	.993	1.000	ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1	True
VidoreSyntheticDocQAEnergyRetrieval	.968	.980	nvidia/llama-nemotron-colembed-vl-3b-v2	True
VidoreSyntheticDocQAGovernmentReportsRetrieval	.973	.989	nvidia/nemotron-colembed-vl-8b-v2	True
VidoreSyntheticDocQAHealthcareIndustryRetrieval	1.000	1.000	VAGOsolutions/SauerkrautLM-ColQwen3-4b-v0.1	True
VidoreTabfquadRetrieval	.964	.981	nvidia/nemotron-colembed-vl-4b-v2	True
VidoreTatdqaRetrieval	.815	.857	DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16	True
Average	.778	.786	nan	-

Model have high performance on these tasks: Vidore3EnergyRetrieval,Vidore3FinanceEnRetrieval,Vidore3PharmaceuticalsRetrieval,Vidore3IndustrialRetrieval,Vidore3FinanceFrRetrieval,Vidore3PhysicsRetrieval

athrael-soju · 2026-06-20T11:03:24Z

@Samoed has the leaderboard been rebuilt? I can't see the model on the leaderboard on V1, V2, or V3?

Samoed · 2026-06-20T11:06:13Z

Not yet. Some parts of new leaderboard are still under review and because of that there is no automatic CI for now. I'll update leaderboard soon

Samoed · 2026-06-20T12:18:20Z

@athrael-soju Your model is available on leaderboard https://mteb-leaderboard.hf.space/models/vultr/VultronRetrieverPrime-Qwen3.5-8B

Samoed approved these changes Jun 19, 2026

View reviewed changes

Samoed enabled auto-merge (squash) June 19, 2026 15:34

Samoed merged commit 9b269a7 into embeddings-benchmark:main Jun 19, 2026
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rename VultronRetrieverPrime-Qwen3.5-8B results to the vultr org#576

Rename VultronRetrieverPrime-Qwen3.5-8B results to the vultr org#576
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:move-vultronprime-to-vultr-org-results

athrael-soju commented Jun 19, 2026

Uh oh!

github-actions Bot commented Jun 19, 2026

Uh oh!

Uh oh!

athrael-soju commented Jun 20, 2026

Uh oh!

Samoed commented Jun 20, 2026

Uh oh!

Samoed commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

athrael-soju commented Jun 19, 2026

Uh oh!

github-actions Bot commented Jun 19, 2026

Model Results Comparison

Results for vultr/VultronRetrieverPrime-Qwen3.5-8B

Uh oh!

Uh oh!

athrael-soju commented Jun 20, 2026

Uh oh!

Samoed commented Jun 20, 2026

Uh oh!

Samoed commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Results for `vultr/VultronRetrieverPrime-Qwen3.5-8B`