DSOD: Learning Deeply Supervised Object Detectors from Scratch
Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang and, Yurong Chen, Xiangyang Xue

TL;DR
DSOD introduces a novel framework for training object detectors from scratch using deep supervision and dense connections, outperforming pre-trained models on standard benchmarks with fewer parameters.
Contribution
The paper proposes the DSOD framework that enables training object detectors from scratch, overcoming previous limitations and achieving state-of-the-art results with compact models.
Findings
DSOD outperforms SSD and Faster R-CNN on PASCAL VOC and MS COCO datasets.
DSOD achieves real-time detection speed with significantly fewer parameters.
Deep supervision via dense connections is crucial for training from scratch.
Abstract
We present Deeply Supervised Object Detector (DSOD), a framework that can learn object detectors from scratch. State-of-the-art object objectors rely heavily on the off-the-shelf networks pre-trained on large-scale classification datasets like ImageNet, which incurs learning bias due to the difference on both the loss functions and the category distributions between classification and detection tasks. Model fine-tuning for the detection task could alleviate this bias to some extent but not fundamentally. Besides, transferring pre-trained models from classification to detection between discrepant domains is even more difficult (e.g. RGB to depth images). A better solution to tackle these two critical problems is to train object detectors from scratch, which motivates our proposed DSOD. Previous efforts in this direction mostly failed due to much more complicated loss functions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsConvolution · Non Maximum Suppression · 1x1 Convolution · SSD
