WaterFlow: Explicit Physics-Prior Rectified Flow for Underwater Saliency Mask Generation

Runting Li, Shijie Lian, Hua Li, Yutong Li, Wenhui Wu, Sam Kwong

This repository is the official implementation of "WaterFlow: Explicit Physics-Prior Rectified Flow for Underwater Saliency Mask Generation" (ICASSP 2026).

Overview

Underwater Salient Object Detection (USOD) faces significant challenges including image quality degradation and domain gaps caused by light absorption and scattering in water. Existing methods tend to ignore the physical principles of underwater imaging, or treat degradation phenomena purely as interference to be removed, failing to exploit the valuable information they contain.

We propose WaterFlow, a rectified flow-based framework that incorporates underwater physical imaging information as explicit priors directly into the network training process. Specifically, we adopt the SeaThru image formation model to decompose each underwater image into backscatter and direct transmission components, and inject these physics-derived features into a PVT backbone at multiple scales during training. At inference time, the model inherits the physically grounded representations learned during training without requiring depth input.

Architecture

Key Contributions

Explicit Physical Prior Integration: WaterFlow incorporates underwater physical imaging information (based on the SeaThru model) as explicit prior knowledge directly into the training process, guiding the backbone to learn physically grounded feature representations.
Rectified Flow for USOD: We are the first to systematically apply Rectified Flow to underwater salient object detection, replacing the DDPM framework in CamoDiffusion with a more efficient straight-trajectory formulation.
Temporal Dimension Modeling: We introduce time ensemble strategies that leverage intermediate denoising steps to improve prediction robustness.
State-of-the-Art Performance: WaterFlow achieves substantial improvements on USOD10K and UFO-120 benchmarks.

Requirements

Python == 3.10
CUDA == 12.6
torch == 2.7.1

pip install -r requirements.txt

Note: mmcv-full==1.7.2 requires a separate installation step:
pip install -U openmim
mim install mmcv-full==1.7.2

Datasets

Download the following underwater SOD datasets and organize them as follows:

media/dataset/
        ├── USOD10K/
        │   ├── TrainDataset/
        │   │   ├── Imgs/
        │   │   ├── GT/
        │   │   └── DepRaw/
        │   └── TestDataset/
        │       ├── Imgs/
        │       └── GT/
        ├── UFO120/
        │   └── TestDataset/
        │       ├── Imgs/
        │       └── GT/
        ├── SUIM/
        └── USOD/

Training

Train at 352×352:

accelerate launch train.py \
  --config config/waterflow_352x352.yaml \
  --num_epoch=150 \
  --batch_size=8 \
  --gradient_accumulate_every=4

Fine-tune at 384×384:

accelerate launch train.py \
  --config config/waterflow_384x384.yaml \
  --num_epoch=20 \
  --batch_size=8 \
  --gradient_accumulate_every=4 \
  --pretrained results/${RESULT_DIRECTORY}/model-best.pt \
  --lr_min=0 \
  --set optimizer.params.lr=1e-5

Evaluation

accelerate launch sample.py \
  --config config/waterflow_384x384.yaml \
  --results_folder ${RESULT_SAVE_PATH} \
  --checkpoint ${CHECKPOINT_PATH} \
  --num_sample_steps 1 \
  --target_dataset USOD10K

Acknowledgements

This project is built on top of CamoDiffusion. The underwater physics model is based on Sea-Thru (Akkaynak & Treibitz, CVPR 2019). The backbone follows PVTv2. The diffusion framework is based on DDPM (Ho et al., NeurIPS 2020) and Rectified Flow (Liu et al., ICLR 2023), with implementations from denoising-diffusion-pytorch and rectified-flow-pytorch by lucidrains.

Citation

If you find this work useful, please cite:

@inproceedings{li2026waterflow,
  title     = {WaterFlow: Explicit Physics-Prior Rectified Flow for Underwater Saliency Mask Generation},
  author    = {Li, Runting and Lian, Shijie and Li, Hua and Li, Yutong and Wu, Wenhui and Kwong, Sam},
  booktitle = {ICASSP},
  year      = {2026}
}

This work extends CamoDiffusion. If you use this repository, please also consider citing:

@article{chen2023camodiffusion,
  title   = {CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models},
  author  = {Chen, Zhongxi and Sun, Ke and Lin, Xianming and Ji, Rongrong},
  journal = {arXiv preprint arXiv:2305.17932},
  year    = {2023}
}

@article{sun2025conditional,
  title     = {Conditional Diffusion Models for Camouflaged and Salient Object Detection},
  author    = {Sun, Ke and Chen, Zhongxi and Lin, Xianming and Sun, Xiaoshuai and Liu, Hong and Ji, Rongrong},
  journal   = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year      = {2025},
  publisher = {IEEE}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WaterFlow: Explicit Physics-Prior Rectified Flow for Underwater Saliency Mask Generation

Overview

Architecture

Key Contributions

Requirements

Datasets

Training

Evaluation

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
config		config
dataset		dataset
denoising-diffusion-pytorch		denoising-diffusion-pytorch
rectified-flow-pytorch		rectified-flow-pytorch
scripts		scripts
utils		utils
waterflow		waterflow
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
sample.py		sample.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

WaterFlow: Explicit Physics-Prior Rectified Flow for Underwater Saliency Mask Generation

Overview

Architecture

Key Contributions

Requirements

Datasets

Training

Evaluation

Acknowledgements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages