Skip to content

Add vultr/VultronRetrieverCore-Qwen3.5-4.5B ViDoRe results#579

Merged
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:add-vultron-core-4-5b-results
Jun 22, 2026
Merged

Add vultr/VultronRetrieverCore-Qwen3.5-4.5B ViDoRe results#579
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:add-vultron-core-4-5b-results

Conversation

@athrael-soju

Copy link
Copy Markdown
Contributor

Adds the 22 per-task ViDoRe V1/V2/V3 result JSONs (+ model_meta.json) for vultr/VultronRetrieverCore-Qwen3.5-4.5B at revision 5b63301ce5a49993f9ec1cf36645840b8cbd8120, the mid-tier of the VultronRetriever family.

Means: ViDoRe V1 ndcg@5 0.9221 / V2 ndcg@5 0.6612 / V3 ndcg@10 0.6372.

Depends on the ModelMeta PR embeddings-benchmark/mteb#4850 merging first, so the model id resolves via get_model_meta. model_meta.json is generated from that entry's ModelMeta.to_dict() at this SHA. JSONs pulled verbatim from the model repo's eval_results/.

@github-actions

Copy link
Copy Markdown

Model Results Comparison

Reference models: intfloat/multilingual-e5-large, google/gemini-embedding-001
New models evaluated: vultr/VultronRetrieverCore-Qwen3.5-4.5B

Results for vultr/VultronRetrieverCore-Qwen3.5-4.5B

task_name vultr/VultronRetrieverCore-Qwen3.5-4.5B Max result Model with max result In Training Data
Vidore2BioMedicalLecturesRetrieval .665 .670 DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16 False
Vidore2ESGReportsHLRetrieval .729 .791 DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16 False
Vidore2ESGReportsRetrieval .639 .660 OpenSearch-AI/Ops-Colqwen3-4B False
Vidore2EconomicsReportsRetrieval .612 .658 DataScience-UIBK/Argus-Colqwen3.5-9b-v0 False
Vidore3ComputerScienceRetrieval .798 .809 webAI-Official/webAI-ColVec1-9b False
Vidore3EnergyRetrieval .692 .703 vultr/VultronRetrieverPrime-Qwen3.5-8B False
Vidore3FinanceEnRetrieval .689 .690 vultr/VultronRetrieverPrime-Qwen3.5-8B False
Vidore3FinanceFrRetrieval .520 .545 vultr/VultronRetrieverPrime-Qwen3.5-8B False
Vidore3HrRetrieval .661 .700 webAI-Official/webAI-ColVec1-9b False
Vidore3IndustrialRetrieval .561 .574 vultr/VultronRetrieverPrime-Qwen3.5-8B False
Vidore3PharmaceuticalsRetrieval .674 .682 vultr/VultronRetrieverPrime-Qwen3.5-8B False
Vidore3PhysicsRetrieval .502 .517 vultr/VultronRetrieverPrime-Qwen3.5-8B False
VidoreArxivQARetrieval .920 .938 VAGOsolutions/SauerkrautLM-ColQwen3-8b-v0.1 True
VidoreDocVQARetrieval .673 .687 webAI-Official/webAI-ColVec1-9b True
VidoreInfoVQARetrieval .940 .952 webAI-Official/webAI-ColVec1-9b True
VidoreShiftProjectRetrieval .938 .947 DataScience-UIBK/Argus-Colqwen3.5-4b-v0 False
VidoreSyntheticDocQAAIRetrieval .996 1.000 ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1 True
VidoreSyntheticDocQAEnergyRetrieval .983 .980 nvidia/llama-nemotron-colembed-vl-3b-v2 True
VidoreSyntheticDocQAGovernmentReportsRetrieval .985 .989 DataScience-UIBK/Argus-Colqwen3.5-9b-v0 True
VidoreSyntheticDocQAHealthcareIndustryRetrieval 1.000 1.000 vultr/VultronRetrieverPrime-Qwen3.5-8B True
VidoreTabfquadRetrieval .962 .981 nvidia/nemotron-colembed-vl-4b-v2 True
VidoreTatdqaRetrieval .824 .857 DataScience-UIBK/Argus-Colqwen3.5-9b-v0-bf16 True
Average .771 .788 nan -

Model have high performance on these tasks: VidoreSyntheticDocQAEnergyRetrieval


@athrael-soju

Copy link
Copy Markdown
Contributor Author

@Samoed ready to roll

@Samoed Samoed merged commit 3c50ebb into embeddings-benchmark:main Jun 22, 2026
3 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants