TL;DR
This paper introduces Feature-Fused SSD, a real-time small object detection method that enhances the baseline SSD with multi-level feature fusion, achieving higher accuracy and faster speeds than existing methods.
Contribution
It proposes a novel multi-level feature fusion approach with two fusion modules to improve small object detection accuracy without sacrificing speed.
Findings
Higher mAP on PASCALVOC2007 compared to baseline SSD
Achieves 40-43 FPS, faster than state-of-the-art DSSD
Improves small object detection accuracy by 2-3 points
Abstract
Small objects detection is a challenging task in computer vision due to its limited resolution and information. In order to solve this problem, the majority of existing methods sacrifice speed for improvement in accuracy. In this paper, we aim to detect small objects at a fast speed, using the best object detector Single Shot Multibox Detector (SSD) with respect to accuracy-vs-speed trade-off as base architecture. We propose a multi-level feature fusion method for introducing contextual information in SSD, in order to improve the accuracy for small objects. In detailed fusion operation, we design two feature fusion modules, concatenation module and element-sum module, different in the way of adding contextual information. Experimental results show that these two fusion modules obtain higher mAP on PASCALVOC2007 than baseline SSD by 1.6 and 1.7 points respectively, especially with 2-3…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution · Non Maximum Suppression · 1x1 Convolution · SSD
