What makes for effective detection proposals?
Jan Hosang, Rodrigo Benenson, Piotr Doll\'ar, Bernt Schiele

TL;DR
This paper analyzes twelve object detection proposal methods, highlighting the importance of localization accuracy and recall, and introduces a new metric, AR, to better predict detection performance.
Contribution
It provides an in-depth comparison of proposal methods, introduces the AR metric, and offers insights for selecting and tuning detection proposals.
Findings
Localization accuracy is as crucial as recall for detection performance.
The proposed AR metric correlates well with detection success.
Different proposal methods have distinct strengths and weaknesses.
Abstract
Current top performing object detectors employ detection proposals to guide the search for objects, thereby avoiding exhaustive sliding window search across images. Despite the popularity and widespread use of detection proposals, it is unclear which trade-offs are made when using them during object detection. We provide an in-depth analysis of twelve proposal methods along with four baselines regarding proposal repeatability, ground truth annotation recall on PASCAL, ImageNet, and MS COCO, and their impact on DPM, R-CNN, and Fast R-CNN detection performance. Our analysis shows that for object detection improving proposal localisation accuracy is as important as improving recall. We introduce a novel metric, the average recall (AR), which rewards both high recall and good localisation and correlates surprisingly well with detection performance. Our findings show common strengths and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · RoIPool · Fast R-CNN · Support Vector Machine · Max Pooling · Convolution · R-CNN
