PyTorch codes for "Multi-Axis Feature Diversity Enhancement for Remote Sensing Video Super-Resolution", IEEE Transactions on Image Processing (IEEE TIP), 2025.
- Authors: Yi Xiao, Qiangqiang Yuan*, Kui Jiang*, Yuzeng Chen, Shiqi Wang, and Chia-Wen Lin
- Wuhan University, Harbin Institute of Technology, City University of Hong Kong, and National Tsinghua University
How to aggregate spatial-temporal information plays an essential role in video super-resolution (VSR) tasks. Despite the remarkable success, existing methods adopt static convolution to encode spatial-temporal information, which lacks flexibility in aggregating information in large-scale remote sensing scenes, as they often contain heterogeneous features (e.g., diverse textures). In this paper, we propose a spatial feature diversity enhancement module (SDE) and channel diversity exploration module (CDE), which explore the diverse representation of different local patterns while aggregating the global response with compact channel-wise embedding representation. Specifically, SDE introduces multiple learnable filters to extract representative spatial variants and encodes them to generate a dynamic kernel for enriched spatial representation. To explore the diversity in the channel dimension, CDE exploits the discrete cosine transform to transform the feature into the frequency domain. This enriches the channel representation while mitigating massive frequency loss caused by pooling operation. Based on SDE and CDE, we further devise a multi-axis feature diversity enhancement (MADE) module to harmonize the spatial, channel, and pixel-wise features for diverse feature fusion. These elaborate strategies form a novel network for satellite VSR, termed MADNet, which achieves favorable performance against state-of-the-art method BasicVSR++ in terms of average PSNR by 0.14 dB on various video satellites, including JiLin-1, Carbonite-2, SkySat-1, and UrtheCast.
git clone https://github.com/XY-boy/MADNet.git
- CUDA 11.1
- pytorch 1.9.1
- torchvision 0.10.1
- build BasicSR
Please refer to MSDTGP to prepare the satellite video dataset Jilin-189.
python basicsr/train.py -opt options/train/MADNet/train_MADNet_JiLin.yml
python basicsr/train.py -opt options/test/MADNet/test_MADNet.yml
Our MADNet mainly borrows from IconVSR, and FcaNet. Thanks for these excellent open-source works!
If you have any questions or suggestions, feel free to contact me.
Email: xiao_yi@whu.edu.cn; xy574475@gmail.com
If you find our work helpful in your research, please consider citing it. We appreciate your support!😊
@ARTICLE{xiao2025madnet,
author={Xiao, Yi and Yuan, Qiangqiang and Jiang, Kui and Chen, Yuzeng and, Wang Shiqi and Lin, Chia-Wen},
journal={IEEE Transactions on Image Processing},
title={Multi-Axis Feature Diversity Enhancement for Remote Sensing Video Super-Resolution},
year={2025},
volume={34},
number={},
pages={1766-1778},
doi={10.1109/TIP.2025.3547298}
}

