Bridging the Scale Gap: Balanced Tiny and General Object Detection in Remote Sensing Imagery
Zhicheng Zhao, Yin Huang, Lingma Sun, Chenglong Li, Jin Tang

TL;DR
This paper introduces ScaleBridge-Det, a novel detection framework that effectively balances tiny and large object detection in remote sensing imagery by using scale-adaptive routing and density-guided query allocation.
Contribution
It presents the first large detection framework for tiny objects that achieves balanced performance across scales through innovative modules like REM and DGQ.
Findings
Achieves state-of-the-art results on AI-TOD-V2 and DTOD datasets.
Demonstrates superior cross-domain robustness on VisDrone.
Effectively balances detection of tiny and large objects without trade-offs.
Abstract
Tiny object detection in remote sensing imagery has attracted significant research interest in recent years. Despite recent progress, achieving balanced detection performance across diverse object scales remains a formidable challenge, particularly in scenarios where dense tiny objects and large objects coexist. Although large foundation models have revolutionized general vision tasks, their application to tiny object detection remains unexplored due to the extreme scale variation and density distribution inherent to remote sensing imagery. To bridge this scale gap, we propose ScaleBridge-Det, to the best of our knowledge, the first large detection framework designed for tiny objects, which could achieve balanced performance across diverse scales through scale-adaptive expert routing and density-guided query allocation. Specifically, we introduce a Routing-Enhanced Mixture Attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques
