Context-Aware Single-Shot Detector
Wei Xiang, Dong-Qing Zhang, Heather Yu, Vassilis Athitsos

TL;DR
This paper introduces CSSD, an enhancement of SSD that incorporates multi-scale context modeling through dilated and deconvolution layers, significantly improving small object detection accuracy while maintaining real-time performance.
Contribution
The paper proposes CSSD, a novel context-aware extension of SSD, with two variants using dilated and deconvolution layers, improving detection accuracy especially for small objects.
Findings
CSSD achieves +3.2% AP for small objects on MS-COCO.
Multi-scale context modeling improves overall detection accuracy.
CSSD maintains comparable runtime to original SSD.
Abstract
SSD is one of the state-of-the-art object detection algorithms, and it combines high detection accuracy with real-time speed. However, it is widely recognized that SSD is less accurate in detecting small objects compared to large objects, because it ignores the context from outside the proposal boxes. In this paper, we present CSSD--a shorthand for context-aware single-shot multibox object detector. CSSD is built on top of SSD, with additional layers modeling multi-scale contexts. We describe two variants of CSSD, which differ in their context layers, using dilated convolution layers (DiCSSD) and deconvolution layers (DeCSSD) respectively. The experimental results show that the multi-scale context modeling significantly improves the detection accuracy. In addition, we study the relationship between effective receptive fields (ERFs) and the theoretical receptive fields (TRFs),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsNon Maximum Suppression · 1x1 Convolution · SSD · Dilated Convolution · Convolution
