Enhancement of SSD by concatenating feature maps for object detection
Jisoo Jeong, Hyojin Park, and Nojun Kwak

TL;DR
This paper enhances the SSD object detection method by effectively concatenating feature maps, leading to improved accuracy and speed, outperforming existing models like YOLO and Faster-RCNN on Pascal VOC datasets.
Contribution
The paper introduces a novel approach to utilize feature maps more effectively in SSD, improving accuracy without increasing model complexity significantly.
Findings
Achieved 78.5% mAP at 35 FPS on Pascal VOC 2007 test.
Achieved 80.8% mAP at 16.6 FPS with larger input size.
Outperformed state-of-the-art models like YOLO and Faster-RCNN.
Abstract
We propose an object detection method that improves the accuracy of the conventional SSD (Single Shot Multibox Detector), which is one of the top object detection algorithms in both aspects of accuracy and speed. The performance of a deep network is known to be improved as the number of feature maps increases. However, it is difficult to improve the performance by simply raising the number of feature maps. In this paper, we propose and analyze how to use feature maps effectively to improve the performance of the conventional SSD. The enhanced performance was obtained by changing the structure close to the classifier network, rather than growing layers close to the input data, e.g., by replacing VGGNet with ResNet. The proposed network is suitable for sharing the weights in the classifier networks, by which property, the training can be faster with better generalization power. For the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Advanced Data Storage Technologies
MethodsAverage Pooling · Global Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection · Convolution · Residual Block
