SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects
Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo,, Sun Xian, Kun Fu

TL;DR
SCRDet is a novel multi-category rotation detector designed to improve detection of small, cluttered, and rotated objects, especially in aerial images, by fusing multi-layer features and employing attention mechanisms for enhanced sensitivity and accuracy.
Contribution
The paper introduces SCRDet, a new detector combining multi-layer feature fusion, attention mechanisms, and an IoU-based loss for better detection of challenging objects in various datasets.
Findings
Achieves state-of-the-art results on remote sensing datasets DOTA and NWPU VHR-10.
Outperforms existing methods on natural image datasets COCO and VOC2007.
Demonstrates robustness in detecting small, cluttered, and rotated objects.
Abstract
Object detection has been a building block in computer vision. Though considerable progress has been made, there still exist challenges for objects with small size, arbitrary direction, and dense distribution. Apart from natural images, such issues are especially pronounced for aerial images of great importance. This paper presents a novel multi-category rotation detector for small, cluttered and rotated objects, namely SCRDet. Specifically, a sampling fusion network is devised which fuses multi-layer feature with effective anchor sampling, to improve the sensitivity to small objects. Meanwhile, the supervised pixel attention network and the channel attention network are jointly explored for small and cluttered object detection by suppressing the noise and highlighting the objects feature. For more accurate rotation estimation, the IoU constant factor is added to the smooth L1 loss to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Image and Object Detection Techniques
