Feature-Fused SSD: Fast Detection for Small Objects

Guimei Cao; Xuemei Xie; Wenzhe Yang; Quan Liao; Guangming Shi and; Jinjian Wu

arXiv:1709.05054·cs.CV·November 28, 2018

Feature-Fused SSD: Fast Detection for Small Objects

Guimei Cao, Xuemei Xie, Wenzhe Yang, Quan Liao, Guangming Shi and, Jinjian Wu

PDF

1 Repo

TL;DR

This paper introduces Feature-Fused SSD, a real-time small object detection method that enhances the baseline SSD with multi-level feature fusion, achieving higher accuracy and faster speeds than existing methods.

Contribution

It proposes a novel multi-level feature fusion approach with two fusion modules to improve small object detection accuracy without sacrificing speed.

Findings

01

Higher mAP on PASCALVOC2007 compared to baseline SSD

02

Achieves 40-43 FPS, faster than state-of-the-art DSSD

03

Improves small object detection accuracy by 2-3 points

Abstract

Small objects detection is a challenging task in computer vision due to its limited resolution and information. In order to solve this problem, the majority of existing methods sacrifice speed for improvement in accuracy. In this paper, we aim to detect small objects at a fast speed, using the best object detector Single Shot Multibox Detector (SSD) with respect to accuracy-vs-speed trade-off as base architecture. We propose a multi-level feature fusion method for introducing contextual information in SSD, in order to improve the accuracy for small objects. In detailed fusion operation, we design two feature fusion modules, concatenation module and element-sum module, different in the way of adding contextual information. Experimental results show that these two fusion modules obtain higher mAP on PASCALVOC2007 than baseline SSD by 1.6 and 1.7 points respectively, especially with 2-3…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wnzhyee/Feature-Fused-SSD
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution · Non Maximum Suppression · 1x1 Convolution · SSD