[ICML 2025 Spotlight] UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Kaizhen Zhu^1,2,*, Mokai Pan^1,2,*, Yuexin Ma^1,2, Yanwei Fu³, Jingyi Yu^1,2, Jingya Wang^1,2, Ye Shi^1,2,†

¹ShanghaiTech University

²MoE Key Laboratory of Intelligent Perception and Human-Machine Collaboration

³Fudan University

[arXiv] [Project page]

We reveal that the diffusion bridge with Doob’s $h$-transform is merely a special case within our framework, arising when the terminal penalty coefficient of the SOC cost function approaches infinity. By introducing this terminal penalty coefficient, UniDB effectively balances control costs and terminal penalties, significantly enhancing detail preservation and image quality. Notably, UniDB integrates seamlessly with existing diffusion bridge models, requiring only minor code adjustments. Extensive experiments in image restoration tasks validate the superiority and adaptability of UniDB.

Visual Results

Intallation

Install the dependencies with Anaconda and activate the environment with:

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
conda create --name UniDB python=3.9
conda activate UniDB
pip install -r requirements.txt

Test

Prepare datasets.
Download pretrained checkpoints here
Modify options, including dataroot_GT, dataroot_LQ and pretrain_model_G.
Choose a model to sample (Default: UniDB): test function in codes/models/denoising_model.py.
python test.py -opt=options/test.yml

The Test results will be saved in \results.

UniDB-GOU

We computed the average distances between high-quality and low-quality images in the three datasets (CelebA-HQ, Rain100H, and DIV2K) related to the subsequent experimental section as the distances $| x_T - x_0 |^2_2$. As can be seen, for all three datasets, these distances remain relatively small, ranging from $10^{-4}$ to $10^{-10}$ when $\gamma$ is within the range of $1\times10^5$ to $1\times10^9$. Therefore, our subsequent experiments will focus on the $\gamma$ of this range to further investigate the performance of UniDB-GOU.

Train

Prepare datasets.
Modify options, including dataroot_GT, dataroot_LQ.
python train.py -opt=options/train.yml for single GPU.
torchrun --nproc_per_node=2 --master_port=1111 train.py -opt=options/train.yml --launcher pytorch for multi GPUs. Attention: see Important Option Details.
For the DIV2K dataset, your GPU memory needs to be greater than 34GB.
You can modify the parameter of gamma in UniDB-GOU/utils/sde_utils.py to balance the control term and the terminal penalty term in the stochastic optimal control, so that the image can achieve better quality.

Here, we mainly focus on modifying the GOU (Generalized Ornstein-Uhlenbeck) process. For modifications related to VE and VP, readers can refer to the derivations in the appendix of our paper and make the changes themselves (which only require modifying one or two lines of code). We will also release the next version as soon as possible.

The Training log will be saved in \experiments.

Interface

We provide the interface.py for the deraining, which can generate HQ only with LQ:

Prepare options/test.yml filling in LQ path.
python interface.py.
The interface will be on the local server: 127.0.0.1.

Other tasks can also be written in imitation.

Important Option Details

dataroot_GT: Ground Truth (High-Quality) data path.
dataroot_LQ: Low-Quality data path.
pretrain_model_G: Pretraind model path.
GT_size, LQ_size: Size of the data cropped during training.
niter: Total training iterations.
val_freq: Frequency of validation during training.
save_checkpoint_freq: Frequency of saving checkpoint during training.
gpu_ids: In multi-GPU training, GPU ids are separated by commas in multi-gpu training.
batch_size: In multi-GPU training, must satisfy relation: batch_size/num_gpu>1.

FID

We provid a brief guidelines for commputing FID of two set of images:

Install FID library: pip install pytorch-fid.
Commpute FID: python -m pytorch_fid GT_images_file_path generated_images_file_path --batch-size 1
if all the images are the same size, you can remove --batch-size 1 to accelerate commputing.

Citation

If you find this repository useful in your research, please consider citing:

@inproceedings{
  zhu2025unidb,
  title={Uni{DB}: A Unified Diffusion Bridge Framework via Stochastic Optimal Control},
  author={Kaizhen Zhu and Mokai Pan and Yuexin Ma and Yanwei Fu and Jingyi Yu and Jingya Wang and Ye Shi},
  booktitle={Forty-second International Conference on Machine Learning},
  year={2025},
  url={https://openreview.net/forum?id=uqCfoVXb67}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
UniDB-GOU		UniDB-GOU
figs		figs
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[ICML 2025 Spotlight] UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

[arXiv] [Project page]

Visual Results

Intallation

Test

UniDB-GOU

Train

Interface

Important Option Details

FID

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

UniDB-SOC/UniDB

Folders and files

Latest commit

History

Repository files navigation

[ICML 2025 Spotlight] UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

[arXiv] [Project page]

Visual Results

Intallation

Test

UniDB-GOU

Train

Interface

Important Option Details

FID

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages