RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

We propose RoboVIP, a multi-view inpainting-based video diffusion model with identity reference as conditions to augment robotics manipulation data in both simulation and real-world robot setup.

🔥 Update | 🔧 Installation | 💻 Inference Augmentation | 🧩 Dataset Preprocessing | 🔥Train

Update 🔥🔥🔥

Release the paper
Release the Video Diffusion Model weights and Inference Code
Less GPU memory intense version (<80GB) of Bridge RLDS
Release the preprocessing code of the dataset
Release the training code for the Video Diffusion Model
Release the simulation testing
Release the training code for simulation

⭐ If you like RoboVIP, please help ⭐⭐star⭐⭐ this repo. Thanks! 🤗

Under Review. Code Will Release Soon.

Citation 📚

If you make use of our work, please cite our paper.

@article{wang2026robovip,
  title={RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation},
  author={Wang, Boyang and Zhang, Haoran and Zhang, Shujie and Hao, Jinkun and Jia, Mingda and Lv, Qi and Mao, Yucheng and Lyu, Zhaoyang and Zeng, Jia and Xu, Xudong and others},
  journal={arXiv preprint arXiv:2601.05241},
  year={2026}
}

Acknowledgment 🤗

RoboVIP is built on diffusers and RoboEngine. We appreciate the authors for sharing their awesome codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__assets__		__assets__
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

Update 🔥🔥🔥

Citation 📚

Acknowledgment 🤗

About

Uh oh!

Releases

Packages

RoboVIP/RoboVIP_VDM

Folders and files

Latest commit

History

Repository files navigation

RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

Update 🔥🔥🔥

Citation 📚

Acknowledgment 🤗

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages