What makes for effective detection proposals?

Jan Hosang; Rodrigo Benenson; Piotr Doll\'ar; Bernt Schiele

arXiv:1502.05082·cs.CV·August 7, 2015

What makes for effective detection proposals?

Jan Hosang, Rodrigo Benenson, Piotr Doll\'ar, Bernt Schiele

PDF

TL;DR

This paper analyzes twelve object detection proposal methods, highlighting the importance of localization accuracy and recall, and introduces a new metric, AR, to better predict detection performance.

Contribution

It provides an in-depth comparison of proposal methods, introduces the AR metric, and offers insights for selecting and tuning detection proposals.

Findings

01

Localization accuracy is as crucial as recall for detection performance.

02

The proposed AR metric correlates well with detection success.

03

Different proposal methods have distinct strengths and weaknesses.

Abstract

Current top performing object detectors employ detection proposals to guide the search for objects, thereby avoiding exhaustive sliding window search across images. Despite the popularity and widespread use of detection proposals, it is unclear which trade-offs are made when using them during object detection. We provide an in-depth analysis of twelve proposal methods along with four baselines regarding proposal repeatability, ground truth annotation recall on PASCAL, ImageNet, and MS COCO, and their impact on DPM, R-CNN, and Fast R-CNN detection performance. Our analysis shows that for object detection improving proposal localisation accuracy is as important as improving recall. We introduce a novel metric, the average recall (AR), which rewards both high recall and good localisation and correlates surprisingly well with detection performance. Our findings show common strengths and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · RoIPool · Fast R-CNN · Support Vector Machine · Max Pooling · Convolution · R-CNN