Contamination Detection for VLMs using Multi-Modal Semantic Perturbation (ICLR 2026)

Novel contamination detection methodology for VLMs that is practical, reliable, and consistent: arXiv Link.

📣 News

(Jan 25th, 2026): Our paper has been accepted at ICLR 2026.

📕 Overview of Multi-Modal Semantic Perturbation

Our pipeline, multi-modal semantic perturbation, generates image-question pairs with the original image composition in tact, but modified slightly so that the answer is changed.

The perturbed benchmark will have a similar or lower difficulty than the original benchmark, meaning clean models that truly generalize should perform better. However, we discover that contaminated models consistently underperform, showing dramatic performance drops up to -45%.

Pipeline of Multi-modal Semantic Perturbation

(Step 1) Randomly sample new answer from the original question
(Step 2) Generate dense captions of the original image, conditioned on the question and the new answer.
(Step 3) Provide the description as the prompt to Flux+ControlNet and generate the perturbed images.

🔨 Setup

To contaminate LLaVA-v1.5 and Qwen2-VL-7B, we follow the official repository and LLaMA-Factory, respectively, and fine-tune the models using the custom data that we would like to contaminate the model with.
To evaluate the contaminated and clean models, we use VLMEvalKit. We provide the .tsv files that can be used to evaluate models on VLMEvalKit.
- Update config.py in the original repo with your contaminated models - e.g in VLMEvalKit/config.py
- Update vlmeval/dataset/image_base.py, image_caption.py, image_mcq.py with the .tsv path accordingly.
The system prompts can be found in prompts.py. This process can be replaced with a lightweight open-source models, as shown in the paper.
For Flux+ControlNet, we follow the default settings from this repository. Replace the main.py with flux/main.py.
Optionally, one can use a strong reasoning model, such as o3 to bypass manual filtering. Refer to prompts.py.

💾 Perturbed Benchmarks

We release the .tsv files that can be used to evaluate models on perturbed RealWorldQA and MMStar in ./tsv. The perturbed images can be downloaded from release v1.0.0.

📌 Citation

@article{park2025vlmcont,
    title={Contamination Detection for VLMs using Multi-Modal Semantic Perturbation}, 
    author={Jaden Park and Mu Cai and Feng Yao and Jingbo Shang and Soochahn Lee and Yong Jae Lee},
    journal={International Conference on Learning Representations},
    year={2026},  
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
VLMEvalKit		VLMEvalKit
figures		figures
flux		flux
tsv		tsv
LICENSE		LICENSE
README.md		README.md
prompts.py		prompts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation (ICLR 2026)

📣 News

📕 Overview of Multi-Modal Semantic Perturbation

Pipeline of Multi-modal Semantic Perturbation

🔨 Setup

💾 Perturbed Benchmarks

📌 Citation

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation (ICLR 2026)

📣 News

📕 Overview of Multi-Modal Semantic Perturbation

Pipeline of Multi-modal Semantic Perturbation

🔨 Setup

💾 Perturbed Benchmarks

📌 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages