DuBox: No-Prior Box Objection Detection via Residual Dual Scale Detectors
Shuai Chen, Jinpeng Li, Chuanqi Yao, Wenbo Hou, Shuo Qin, Wenyao Jin,, Xu Tang

TL;DR
Dubox introduces a no-prior box, dual-scale residual detection method that improves speed and accuracy in object detection by reducing computational redundancy and enhancing small object detection.
Contribution
It proposes a novel one-stage detection approach that eliminates the need for prior boxes using dual-scale residual units and a progressive loss function.
Findings
Achieves competitive accuracy on VOC and COCO benchmarks.
Reduces computational redundancy compared to traditional methods.
Enhances detection of small objects with dual-scale residual learning.
Abstract
Traditional neural objection detection methods use multi-scale features that allow multiple detectors to perform detecting tasks independently and in parallel. At the same time, with the handling of the prior box, the algorithm's ability to deal with scale invariance is enhanced. However, too many prior boxes and independent detectors will increase the computational redundancy of the detection algorithm. In this study, we introduce Dubox, a new one-stage approach that detects the objects without prior box. Working with multi-scale features, the designed dual scale residual unit makes dual scale detectors no longer run independently. The second scale detector learns the residual of the first. Dubox has enhanced the capacity of heuristic-guided that can further enable the first scale detector to maximize the detection of small targets and the second to detect objects that cannot be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
