RIS-LAD: A Benchmark and Model for Referring Low-Altitude Drone Image Segmentation
Kai Ye, YingShi Luan, Zhudi Chen, Guangyue Meng, Pingyang Dai, Liujuan Cao

TL;DR
This paper introduces RIS-LAD, a new benchmark dataset for referring image segmentation in low-altitude drone scenarios, along with a novel model SAARN that improves segmentation by semantic-aware reasoning.
Contribution
It provides the first fine-grained RIS benchmark for LAD scenes and proposes SAARN, a model that enhances segmentation by decomposing and routing semantic information adaptively.
Findings
RIS-LAD presents new challenges for RIS algorithms.
SAARN outperforms existing methods on RIS-LAD.
The dataset and model advance vision-language understanding in drone imagery.
Abstract
Referring Image Segmentation (RIS), which aims to segment specific objects based on natural language descriptions, plays an essential role in vision-language understanding. Despite its progress in remote sensing applications, RIS in Low-Altitude Drone (LAD) scenarios remains underexplored. Existing datasets and methods are typically designed for high-altitude and static-view imagery. They struggle to handle the unique characteristics of LAD views, such as diverse viewpoints and high object density. To fill this gap, we present RIS-LAD, the first fine-grained RIS benchmark tailored for LAD scenarios. This dataset comprises 13,871 carefully annotated image-text-mask triplets collected from realistic drone footage, with a focus on small, cluttered, and multi-viewpoint scenes. It highlights new challenges absent in previous benchmarks, such as category drift caused by tiny objects and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · UAV Applications and Optimization · Infrared Target Detection Methodologies
