About Me

Victor Besnier

Hey there! I'm Victor — a Research Scientist focused on Computer Vision. Since 2018, when I first explored GANs during an internship, I’ve been passionate about image and video generation. Over time, I’ve worked with diffusion models, flow matching, and recently explored discrete approaches inspired by GPT-style and BERT-style, the latter enabling fast parallel token sampling for image generation. My work has been published at top venues like CVPR, ICCV, and ICLR. Beyond my research, I’m interested in building real-time models for autonomous driving and creating efficient retrieval systems to navigate large datasets of unlabeled images and videos.

Publications

Paper Thumbnail

Halton Scheduler For Masked Generative Image Transformer

ICLR 2025

Authors: V. Besnier, M. Chen, D. Hurych, E. Valle, M. Cord

A novel scheduling strategy for MaskGIT, using with Halton sequences. Improves quality and diversity.

Read on arXiv →
Paper Thumbnail

Don't Drop Your Samples! Coherence-aware training for Conditional Diffusion

CVPR 2024

Authors: N. Dufour, V. Besnier, V. Kalogeiton, D. Picard

We introduce coherence-aware training of conditional diffusion models, reducing the need to filter dataset.

Read on arXiv →
Paper Thumbnail

Triggering Failures: OOD Detection in Semantic Segmentation

ICCV 2021

Authors: V. Besnier, A. Bursuc, D. Picard, A. Briot

Explores learning from local adversarial attacks for more robust out-of-distribution detection in segmentation models.

Read on arXiv →
Paper Thumbnail

VaViM and VaVAM: Autonomous Driving through Video Generative Modeling

ArXiv 2025

Authors: F. Bartoccioni, E. Ramzi, V. Besnier, S. Venkataramanan et al.

We explore the potential of large-scale generative video models for autonomous driving, introducing an open-source auto-regressive video model (VaViM) and its companion video-action model (VaVAM).

Read on arXiv →

📰 News

Feb 2025

Our paper on the Halton Scheduler for MaskGIT is accepted at ICLR 2025! Code & Paper.

Jan 2025

I became a Valeo Expert — 6 exciting years so far! 🎉

Jan 2025

Released VaVim, a large-scale model for autonomous driving. Code & Paper.

Jun 2024

Presented 'Don't drop your sample!' and 'SegAD' at CVPR 2024 in Seattle. 🚀

Feb 2023

Released a MaskGIT training tech report and code. Code & Paper.

Feb 2023

Started at Valeo.ai Prague as a Research Scientist in video & image synthesis.

Nov 2022

🎓 Got my PhD in ML for autonomous systems!

Positions

2023–Present: Research Scientist, Valeo.ai

Focus: Image and video synthesis with Diffusion Models, MaskGIT, Auto-regressive models, and GANs.
Collaborators: Patrick Pérez, Matthieu Cord
Location: Prague, Czech Republic

2019–2022: PhD Candidate, Valeo

Thesis: Safety of automotive systems through Machine Learning
Advisors: David Picard, Alexandre Briot
Location: Créteil, France

2019: Research Intern, Valeo.ai

Topic: Generative Adversarial Networks and latent space optimization for image classification
Advisors: Matthieu Cord, Himalaya Jain
Location: Paris, France

2018: Research Intern, LIP6

Topic: Unsupervised super-resolution using Generative Adversarial Networks
Advisor: Ludovic Denoyer
Location: Paris, France

Education

2019–2022: PhD, École des Ponts ParisTech (ENPC)

Thesis: Safety of autonomous systems using Machine Learning
Location: Champs-sur-Marne, France

2017–2019: Master's Degree, Sorbonne Université (Paris VI)

Program: DAC – Data Science and Machine Learning
Distinction: Graduated with honors
Location: Paris, France

2014–2017: Bachelor's Degree, UPMC (Paris VI)

Major: Computer Science and Mathematics
Distinction: Graduated with honors
Location: Paris, France

Contact

Feel free to reach out or connect with me: