Skip to content

Add vultr/VultronRetrieverFlash-Qwen3.5-0.8B#4845

Merged
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:add-vultron-flash-0-8b
Jun 21, 2026
Merged

Add vultr/VultronRetrieverFlash-Qwen3.5-0.8B#4845
Samoed merged 1 commit into
embeddings-benchmark:mainfrom
athrael-soju:add-vultron-flash-0-8b

Conversation

@athrael-soju

Copy link
Copy Markdown
Contributor

Adds a ModelMeta for vultr/VultronRetrieverFlash-Qwen3.5-0.8B, the 0.8B small-tier sibling of the already-registered vultr/VultronRetrieverPrime-Qwen3.5-8B.

  • Late-interaction (ColBERT-style, MaxSim) visual document retriever, ColQwen3_5 architecture, 320-dim multi-vector, image+text.
  • Reuses the existing ColQwen3_5Wrapper verbatim (same loader as colqwen3_5_v3 and the 8B Prime entry); embed_dim=320 is set via ModelMeta, no wrapper change.
  • Apache-2.0, 6 languages (en/fr/de/es/it/pt), base Qwen/Qwen3.5-0.8B.
  • training_datasets is the same pool as the 8B Prime entry (VULTRON_PRIME_8B_TRAINING_DATA).

Official MTEB ViDoRe (late-interaction evaluator, dim 320 / 1792 visual tokens): V1 0.8815 (ndcg@5) / V2 0.6036 (ndcg@5) / V3 0.5649 (ndcg@10).

The matching results PR (V1/V2/V3 task JSONs) to embeddings-benchmark/results follows once this merges, so the model name resolves via get_model_meta.

Signed-off-by: Athrael Soju <athrael.soju@gmail.com>
@athrael-soju

Copy link
Copy Markdown
Contributor Author

@Samoed ❤️

@Samoed Samoed merged commit fe5f0d0 into embeddings-benchmark:main Jun 21, 2026
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants