PhaSR: Generalized Image Shadow Removal with Physically Aligned Priors

¹National Yang Ming Chiao Tung University, ²National Cheng Kung University

Abstract

Shadow removal under diverse lighting requires disentangling illumination from intrinsic reflectance— a challenge when physical priors are misaligned.

We propose PhaSR with dual-level prior alignment: (1) PAN performs parameter-free illumination correction via Gray-world normalization and log-domain Retinex decomposition, suppressing chromatic bias. (2) GSRA extends differential attention to harmonize depth-derived geometry with DINO-v2 semantics, resolving modal conflicts across illumination conditions.

Experiments demonstrate competitive performance with lower complexity, generalizing to ambient lighting where traditional methods fail.

Network Architecture

A multi-scale Transformer encoder-decoder integrates frozen DINO-v2 semantic features and DepthAnything-v2 geometric priors via GSRA's cross-modal differential attention (A_rect = A_sem - λ·A_geo).

Comparison Results on WSRD+ (Realworld-Indoor)

Sample 1

PhaSR (Ours)

Input

PhaSR

DenseSR

Input

DenseSR

ShadowRefiner

Input

ShadowRefiner

StableShadowDiffusion

Input

StableShadowDiffusion

Sample 2

PhaSR (Ours)

Input

PhaSR

DenseSR

Input

DenseSR

ShadowRefiner

Input

ShadowRefiner

StableShadowDiffusion

Input

StableShadowDiffusion

Comparison Results on INS Dataset (Synthesized-Indoor)

Sample 3 - Full Image

PhaSR (Ours)

Input

PhaSR

DenseSR

Input

DenseSR

OmniSR

Input

OmniSR

StableShadowDiffusion

Input

StableShadowDiffusion

Sample 3 - Cropped Region

PhaSR (Ours)

Input

PhaSR

DenseSR

Input

DenseSR

OmniSR

Input

OmniSR

StableShadowDiffusion

Input

StableShadowDiffusion

Sample 4 - Full Image

PhaSR (Ours)

Input

PhaSR

DenseSR

Input

DenseSR

OmniSR

Input

OmniSR

StableShadowDiffusion

Input

StableShadowDiffusion

Sample 4 - Cropped Region

PhaSR (Ours)

Input

PhaSR

DenseSR

Input

DenseSR

OmniSR

Input

OmniSR

StableShadowDiffusion

Input

StableShadowDiffusion

BibTeX

@misc{lee2024phasr, title={PhaSR: Generalized Image Shadow Removal with Physically Aligned Priors}, author={Lee, Chia-Ming and Lin, Yu-Fan and Hsiao, Yu-Jou and Jiang, Jin-Hui and Liu, Yu-Lun and Hsu, Chih-Chung}, year={2026}, eprint={2601.17470}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2601.17470}, }

PhaSR: Generalized Image Shadow Removal with Physically Aligned Priors

CVPR 2026

Motivation: Existing methods lose physical priors through encoder-decoder bottlenecks, failing to localize shadows accurately (see feature visualizations above). PhaSR addresses this via dual-level physically aligned priors, generalizing from single-light to multi-source ambient lighting scenarios.

Abstract

Network Architecture

Qualitative Results

Comparison Results on WSRD+ (Realworld-Indoor)

Sample 1

Sample 2

Comparison Results on INS Dataset (Synthesized-Indoor)

Sample 3 - Full Image

Sample 3 - Cropped Region

Sample 4 - Full Image

Sample 4 - Cropped Region

BibTeX