Add vultr/VultronRetrieverCore-Qwen3.5-4.5B ViDoRe results by athrael-soju · Pull Request #579 · embeddings-benchmark/results

athrael-soju · 2026-06-22T17:31:15Z

Adds the 22 per-task ViDoRe V1/V2/V3 result JSONs (+ model_meta.json) for vultr/VultronRetrieverCore-Qwen3.5-4.5B at revision 5b63301ce5a49993f9ec1cf36645840b8cbd8120, the mid-tier of the VultronRetriever family.

Means: ViDoRe V1 ndcg@5 0.9221 / V2 ndcg@5 0.6612 / V3 ndcg@10 0.6372.

Depends on the ModelMeta PR embeddings-benchmark/mteb#4850 merging first, so the model id resolves via get_model_meta. model_meta.json is generated from that entry's ModelMeta.to_dict() at this SHA. JSONs pulled verbatim from the model repo's eval_results/.

github-actions · 2026-06-22T18:58:58Z

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: vultr/VultronRetrieverCore-Qwen3.5-4.5B

Results for `vultr/VultronRetrieverCore-Qwen3.5-4.5B`

task_name	vultr/VultronRetrieverCore-Qwen3.5-4.5B	Max result	Model with max result	In Training Data
Vidore2BioMedicalLecturesRetrieval	.665	.670	DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16	False
Vidore2ESGReportsHLRetrieval	.729	.791	DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16	False
Vidore2ESGReportsRetrieval	.639	.660	OpenSearch-AI/Ops-Colqwen3-4B	False
Vidore2EconomicsReportsRetrieval	.612	.658	DataScience-UIBK/Argus-Colqwen3.5-9b-v0	False
Vidore3ComputerScienceRetrieval	.798	.809	webAI-Official/webAI-ColVec1-9b	False
Vidore3EnergyRetrieval	.692	.703	vultr/VultronRetrieverPrime-Qwen3.5-8B	False
Vidore3FinanceEnRetrieval	.689	.690	vultr/VultronRetrieverPrime-Qwen3.5-8B	False
Vidore3FinanceFrRetrieval	.520	.545	vultr/VultronRetrieverPrime-Qwen3.5-8B	False
Vidore3HrRetrieval	.661	.700	webAI-Official/webAI-ColVec1-9b	False
Vidore3IndustrialRetrieval	.561	.574	vultr/VultronRetrieverPrime-Qwen3.5-8B	False
Vidore3PharmaceuticalsRetrieval	.674	.682	vultr/VultronRetrieverPrime-Qwen3.5-8B	False
Vidore3PhysicsRetrieval	.502	.517	vultr/VultronRetrieverPrime-Qwen3.5-8B	False
VidoreArxivQARetrieval	.920	.938	VAGOsolutions/SauerkrautLM-ColQwen3-8b-v0.1	True
VidoreDocVQARetrieval	.673	.687	webAI-Official/webAI-ColVec1-9b	True
VidoreInfoVQARetrieval	.940	.952	webAI-Official/webAI-ColVec1-9b	True
VidoreShiftProjectRetrieval	.938	.947	DataScience-UIBK/Argus-Colqwen3.5-4b-v0	False
VidoreSyntheticDocQAAIRetrieval	.996	1.000	ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1	True
VidoreSyntheticDocQAEnergyRetrieval	.983	.980	nvidia/llama-nemotron-colembed-vl-3b-v2	True
VidoreSyntheticDocQAGovernmentReportsRetrieval	.985	.989	DataScience-UIBK/Argus-Colqwen3.5-9b-v0	True
VidoreSyntheticDocQAHealthcareIndustryRetrieval	1.000	1.000	vultr/VultronRetrieverPrime-Qwen3.5-8B	True
VidoreTabfquadRetrieval	.962	.981	nvidia/nemotron-colembed-vl-4b-v2	True
VidoreTatdqaRetrieval	.824	.857	DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16	True
Average	.771	.788	nan	-

Model have high performance on these tasks: VidoreSyntheticDocQAEnergyRetrieval

athrael-soju · 2026-06-22T19:35:25Z

@Samoed ready to roll

Add vultr/VultronRetrieverCore-Qwen3.5-4.5B ViDoRe results

1d0a8ba

athrael-soju mentioned this pull request Jun 22, 2026

Add vultr/VultronRetrieverCore-Qwen3.5-4.5B embeddings-benchmark/mteb#4850

Merged

Samoed approved these changes Jun 22, 2026

View reviewed changes

Samoed merged commit 3c50ebb into embeddings-benchmark:main Jun 22, 2026
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add vultr/VultronRetrieverCore-Qwen3.5-4.5B ViDoRe results#579

Add vultr/VultronRetrieverCore-Qwen3.5-4.5B ViDoRe results#579
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:add-vultron-core-4-5b-results

athrael-soju commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 22, 2026

Uh oh!

athrael-soju commented Jun 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

athrael-soju commented Jun 22, 2026

Uh oh!

github-actions Bot commented Jun 22, 2026

Model Results Comparison

Results for vultr/VultronRetrieverCore-Qwen3.5-4.5B

Uh oh!

athrael-soju commented Jun 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Results for `vultr/VultronRetrieverCore-Qwen3.5-4.5B`