RealWonder: Real-Time Physical Action-Conditioned Video Generation

About

Current video generation models cannot simulate physical consequences of 3D actions like forces and robotic manipulations, as they lack structural understanding of how actions affect 3D scenes. We present RealWonder, the first real-time system for action-conditioned video generation from a single image. Our key insight is using physics simulation as an intermediate bridge: instead of directly encoding continuous actions, we translate them through physics simulation into visual representations (optical flow and RGB) that video models can process. RealWonder integrates three components: 3D reconstruction from single images, physics simulation, and a distilled video generator requiring only 4 diffusion steps. Our system achieves 13.2 FPS at 480×832 resolution, enabling interactive exploration of forces, robot actions, and camera controls on rigid objects, deformable bodies, fluids, and granular materials.

RealWonder: Real-Time Physical Action-Conditioned Video Generation
Project Page | Paper
Wei Liu*, Ziyu Chen*, Zizhang Li, Yue Wang, Hong-Xing (Koven) Yu†, Jiajun Wu†
Stanford University, University of Southern California
*Equal contribution †Equal advising

Installation

1. Create Environment

conda env create -f default.yml
conda activate realwonder

2. Install SAM 3D Objects

cd submodules/sam_3d_objects
export PIP_EXTRA_INDEX_URL="https://pypi.ngc.nvidia.com https://download.pytorch.org/whl/cu121"
pip install -e '.[dev]'
pip install -e '.[p3d]'
export PIP_FIND_LINKS="https://nvidia-kaolin.s3.us-east-2.amazonaws.com/torch-2.5.1_cu121.html"
pip install -e '.[inference]'
./patching/hydra
cd ../..

Checkpoints

pip install 'huggingface-hub[cli]<1.0'
TAG=hf
hf download --repo-type model --local-dir checkpoints/${TAG}-download --max-workers 1 facebook/sam-3d-objects
mv checkpoints/${TAG}-download/checkpoints checkpoints/${TAG}
rm -rf checkpoints/${TAG}-download

3. Install SAM 2

cd submodules/sam2
pip install -e .
cd checkpoints && ./download_ckpts.sh && cd ..
cd ../..

4. Install Genesis

cd submodules/Genesis
git checkout 3aa206cd84729bc7cc14fb4007aeb95a0bead7aa
pip install -e .
cd ../..

5. Install Other Dependencies

pip install -r requirements.txt

6. Download Model Checkpoints

hf download ziyc/realwonder --include "Realwonder-Distilled-AR-I2V-Flow/*" --local-dir ckpts/
hf download alibaba-pai/Wan2.1-Fun-V1.1-1.3B-InP --local-dir wan_models/Wan2.1-Fun-V1.1-1.3B-InP

Usage

Interactive Demo (Real-Time UI)

Tested on NVIDIA H200 GPU with CUDA 12.1.

Installation

pip install -r demo_web/requirements.txt

How to run

cd demo_web
python app.py \
    --demo_data demo_data/lamp \
    --checkpoint_path /path/to/checkpoint.pt

Offline Inference

Run physics simulation:

python case_simulation.py --config_path demo_data/lamp/config.yaml

Run video generation from simulation results:

python infer_sim.py \
    --checkpoint_path ckpts/Realwonder-Distilled-AR-I2V-Flow/sink_size=1-attn_size=21-frame_per_block=3-denoising_steps=4/step=000800.pt \
    --sim_data_path result/lamp/final_sim \
    --output_path result/lamp/final_sim/final.mp4

Citation

@misc{realwonder2026,
  title={RealWonder: Real-Time Physical Action-Conditioned Video Generation},
  author={Liu, Wei and Chen, Ziyu and Li, Zizhang and Wang, Yue and Yu, Hong-Xing and Wu, Jiajun},
  year={2026},
  eprint={2603.05449},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2603.05449},
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
cases		cases
demo_web		demo_web
simulation		simulation
vidgen		vidgen
wan		wan
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.md		LICENSE.md
README.md		README.md
case_simulation.py		case_simulation.py
default.yml		default.yml
infer_sim.py		infer_sim.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RealWonder: Real-Time Physical Action-Conditioned Video Generation

About

Installation

1. Create Environment

2. Install SAM 3D Objects

Checkpoints

3. Install SAM 2

4. Install Genesis

5. Install Other Dependencies

6. Download Model Checkpoints

Usage

Interactive Demo (Real-Time UI)

Installation

How to run

Offline Inference

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RealWonder: Real-Time Physical Action-Conditioned Video Generation

About

Installation

1. Create Environment

2. Install SAM 3D Objects

Checkpoints

3. Install SAM 2

4. Install Genesis

5. Install Other Dependencies

6. Download Model Checkpoints

Usage

Interactive Demo (Real-Time UI)

Installation

How to run

Offline Inference

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages