TL;DR
This paper introduces SSPNet, a novel scale selection pyramid network designed to improve tiny person detection in UAV images by addressing scale variability and gradient inconsistency issues, achieving superior performance on benchmarks.
Contribution
The paper proposes SSPNet with three modules (CAM, SEM, SSM) and a WNS strategy, enhancing tiny object detection by better scale focus and gradient consistency.
Findings
Outperforms state-of-the-art detectors on TinyPerson benchmark
Effectively highlights scale-specific features for tiny objects
Improves detection accuracy in UAV imagery
Abstract
With the increasing demand for search and rescue, it is highly demanded to detect objects of interest in large-scale images captured by Unmanned Aerial Vehicles (UAVs), which is quite challenging due to extremely small scales of objects. Most existing methods employed Feature Pyramid Network (FPN) to enrich shallow layers' features by combing deep layers' contextual features. However, under the limitation of the inconsistency in gradient computation across different layers, the shallow layers in FPN are not fully exploited to detect tiny objects. In this paper, we propose a Scale Selection Pyramid network (SSPNet) for tiny person detection, which consists of three components: Context Attention Module (CAM), Scale Enhancement Module (SEM), and Scale Selection Module (SSM). CAM takes account of context information to produce hierarchical attention heatmaps. SEM highlights features of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods1x1 Convolution · Convolution · Class-activation map
