VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images
Deliang Wang, Peng Liu, Yan Ma, Rongkai Zhuang, Lajiao Chen, Bing Li, Yi Zeng

TL;DR
This paper introduces RS-ISRefiner, a click-based interactive segmentation framework tailored for remote sensing images, employing an adapter-based strategy and hybrid attention to improve accuracy and efficiency over existing methods.
Contribution
Proposes a novel remote sensing-specific IIS framework with an adapter-based tuning strategy and hybrid attention mechanism, enhancing segmentation accuracy and robustness.
Findings
Outperforms state-of-the-art IIS methods on six remote sensing datasets.
Achieves higher boundary accuracy and efficiency in interactive segmentation.
Demonstrates strong generalizability across diverse remote sensing scenarios.
Abstract
Interactive image segmentation(IIS) plays a critical role in generating precise annotations for remote sensing imagery, where objects often exhibit scale variations, irregular boundaries and complex backgrounds. However, existing IIS methods, primarily designed for natural images, struggle to generalize to remote sensing domains due to limited annotated data and computational overhead. To address these challenges, we proposed RS-ISRefiner, a novel click-based IIS framework tailored for remote sensing images. The framework employs an adapter-based tuning strategy that preserves the general representations of Vision Foundation Models while enabling efficient learning of remote sensing-specific spatial and boundary characteristics. A hybrid attention mechanism integrating convolutional local modeling with Transformer-based global reasoning enhances robustness against scale diversity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Visual Attention and Saliency Detection
