SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from   UAV Images

Mingbo Hong; Shuiwang Li; Yuchao Yang; Feiyu Zhu; Qijun Zhao; Li Lu

arXiv:2107.01548·cs.CV·February 16, 2022

SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from UAV Images

Mingbo Hong, Shuiwang Li, Yuchao Yang, Feiyu Zhu, Qijun Zhao, Li Lu

PDF

2 Repos

TL;DR

This paper introduces SSPNet, a novel scale selection pyramid network designed to improve tiny person detection in UAV images by addressing scale variability and gradient inconsistency issues, achieving superior performance on benchmarks.

Contribution

The paper proposes SSPNet with three modules (CAM, SEM, SSM) and a WNS strategy, enhancing tiny object detection by better scale focus and gradient consistency.

Findings

01

Outperforms state-of-the-art detectors on TinyPerson benchmark

02

Effectively highlights scale-specific features for tiny objects

03

Improves detection accuracy in UAV imagery

Abstract

With the increasing demand for search and rescue, it is highly demanded to detect objects of interest in large-scale images captured by Unmanned Aerial Vehicles (UAVs), which is quite challenging due to extremely small scales of objects. Most existing methods employed Feature Pyramid Network (FPN) to enrich shallow layers' features by combing deep layers' contextual features. However, under the limitation of the inconsistency in gradient computation across different layers, the shallow layers in FPN are not fully exploited to detect tiny objects. In this paper, we propose a Scale Selection Pyramid network (SSPNet) for tiny person detection, which consists of three components: Context Attention Module (CAM), Scale Enhancement Module (SEM), and Scale Selection Module (SSM). CAM takes account of context information to produce hierarchical attention heatmaps. SEM highlights features of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods1x1 Convolution · Convolution · Class-activation map