CroBIM-U: Uncertainty-Driven Referring Remote Sensing Image Segmentation
Yuzhe Sun, Zhe Dong, Haochen Jiang, Tianzhu Liu, and Yanfeng Gu

TL;DR
This paper introduces an uncertainty-guided framework for referring remote sensing image segmentation, improving localization accuracy by adaptively handling ambiguous regions using a pixel-wise uncertainty map.
Contribution
It proposes a novel uncertainty-guided approach with a Referring Uncertainty Scorer and adaptive modules for fusion and refinement, enhancing segmentation robustness in complex scenes.
Findings
Significant improvement in segmentation accuracy on remote sensing datasets.
Enhanced robustness and geometric fidelity in complex imagery.
Effective integration without altering backbone architecture.
Abstract
Referring remote sensing image segmentation aims to localize specific targets described by natural language within complex overhead imagery. However, due to extreme scale variations, dense similar distractors, and intricate boundary structures, the reliability of cross-modal alignment exhibits significant \textbf{spatial non-uniformity}. Existing methods typically employ uniform fusion and refinement strategies across the entire image, which often introduces unnecessary linguistic perturbations in visually clear regions while failing to provide sufficient disambiguation in confused areas. To address this, we propose an \textbf{uncertainty-guided framework} that explicitly leverages a pixel-wise \textbf{referring uncertainty map} as a spatial prior to orchestrate adaptive inference. Specifically, we introduce a plug-and-play \textbf{Referring Uncertainty Scorer (RUS)}, which is trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
