Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation
Sen Lei, Xinyu Xiao, Tianlin Zhang, Heng-Chao Li, Zhenwei Shi, Qing, Zhu

TL;DR
This paper introduces a fine-grained image-text alignment approach for remote sensing image segmentation, decoupling expressions into object and spatial components, and employing multi-scale fusion to improve discriminative multi-modal feature extraction.
Contribution
It proposes a novel fine-grained alignment module and a multi-scale enhancement module tailored for remote sensing image segmentation, advancing beyond coarse alignment methods.
Findings
Outperforms state-of-the-art on RefSegRS and RRSIS-D datasets.
Effectively captures discriminative multi-modal features.
Improves segmentation accuracy for various object scales.
Abstract
Given a language expression, referring remote sensing image segmentation (RRSIS) aims to identify ground objects and assign pixel-wise labels within the imagery. The one of key challenges for this task is to capture discriminative multi-modal features via text-image alignment. However, the existing RRSIS methods use one vanilla and coarse alignment, where the language expression is directly extracted to be fused with the visual features. In this paper, we argue that a ``fine-grained image-text alignment'' can improve the extraction of multi-modal information. To this point, we propose a new referring remote sensing image segmentation method to fully exploit the visual and linguistic representations. Specifically, the original referring expression is regarded as context text, which is further decoupled into the ground object and spatial position texts. The proposed fine-grained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies
