TL;DR
This paper introduces a text-guided remote sensing image transmission framework that significantly reduces data volume while maintaining high-quality reconstruction, leveraging cross-modal learning and textual summaries.
Contribution
It presents a novel system combining text generation and image restoration to enable efficient remote sensing image transmission and reconstruction.
Findings
Reduces data transmission to about 2% of original size.
Achieves PSNRs of 16.36 dB, 26.87 dB, and 27.41 dB on different datasets.
Demonstrates effective reconstruction with semantic coherence.
Abstract
High-resolution remote sensing imagery is critical for environmental monitoring, urban mapping, and land cover analysis, but its transmission is often hindered by limited bandwidth and high communication costs. Conventional pipelines transmit full-resolution pixel data, resulting in redundant and inefficient delivery. This paper proposes a text-guided remote sensing image transmission system that replaces complete high-resolution data with low-resolution images accompanied by compact textual descriptions. An onboard text generator produces spatial and semantic summaries, reducing the transmitted data volume to approximately 2\% of the original size. For ground-based reconstruction, a text-conditioned image restoration model is introduced, which leverages cross-modal learning to recover fine spatial details and maintain semantic coherence. Experimental results on the Alsat-2B, UC Merced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
