Probabilistic two-stage detection
Xingyi Zhou, Vladlen Koltun, Philipp Kr\"ahenb\"uhl

TL;DR
This paper introduces a probabilistic framework for two-stage object detection, improving accuracy and speed by leveraging one-stage detectors for better likelihood estimation, achieving state-of-the-art results on COCO.
Contribution
It proposes a novel probabilistic interpretation of two-stage detection, enabling the construction of faster, more accurate detectors from existing one-stage models.
Findings
Achieves 56.4 mAP on COCO test-dev with single-scale testing.
Outperforms YOLOv4 at 33 fps using a lightweight backbone.
Provides a probabilistic rationale for training practices in detection.
Abstract
We develop a probabilistic interpretation of two-stage object detection. We show that this probabilistic interpretation motivates a number of common empirical training practices. It also suggests changes to two-stage detection pipelines. Specifically, the first stage should infer proper object-vs-background likelihoods, which should then inform the overall score of the detector. A standard region proposal network (RPN) cannot infer this likelihood sufficiently well, but many one-stage detectors can. We show how to build a probabilistic two-stage detector from any state-of-the-art one-stage detector. The resulting detectors are faster and more accurate than both their one- and two-stage precursors. Our detector achieves 56.4 mAP on COCO test-dev with single-scale testing, outperforming all published results. Using a lightweight backbone, our detector achieves 49.2 mAP on COCO at 33 fps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
Methods1x1 Convolution · Average Pooling · Feature Pyramid Network · Max Pooling · Convolution
