SeG-SR: Integrating Semantic Knowledge into Remote Sensing Image Super-Resolution via Vision-Language Model
Bowen Chen, Keyan Chen, Mohan Yang, Zhengxia Zou, Zhenwei Shi

TL;DR
This paper introduces SeG-SR, a novel remote sensing image super-resolution framework that integrates high-level semantic knowledge from vision-language models to improve reconstruction quality and semantic consistency.
Contribution
SeG-SR is the first to incorporate semantic knowledge via vision-language models into remote sensing image super-resolution, enhancing performance and scene understanding.
Findings
Achieved state-of-the-art results on three datasets.
Significantly improved PSNR and SSIM scores.
Enhanced generalization across different SR architectures.
Abstract
High-resolution (HR) remote sensing imagery plays a vital role in a wide range of applications, including urban planning and environmental monitoring. However, due to limitations in sensors and data transmission links, the images acquired in practice often suffer from resolution degradation. Remote Sensing Image Super-Resolution (RSISR) aims to reconstruct HR images from low-resolution (LR) inputs, providing a cost-effective and efficient alternative to direct HR image acquisition. Existing RSISR methods primarily focus on low-level characteristics in pixel space, while neglecting the high-level understanding of remote sensing scenes. This may lead to semantically inconsistent artifacts in the reconstructed results. Motivated by this observation, our work aims to explore the role of high-level semantic knowledge in improving RSISR performance. We propose a Semantic-Guided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
