Learning Detection with Diverse Proposals
Samaneh Azadi, Jiashi Feng, and Trevor Darrell

TL;DR
This paper introduces a differentiable DPP layer for object detection that enhances proposal diversity and contextual understanding, leading to more accurate and semantically aware detections without increasing model complexity.
Contribution
The novel trainable DPP layer integrates into detection architectures to model proposal diversity and relations, improving detection accuracy over traditional methods like NMS.
Findings
LDDP improves localization and classification accuracy.
LDDP maintains superiority with fewer proposals (~30%).
Enhances contextual and semantic relations between proposals.
Abstract
To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground-truth but ignore correlation between multiple proposals and object categories. Non-Maximum Suppression (NMS) as a widely used proposal pruning scheme ignores label- and instance-level relations between object candidates resulting in multi-labeled detections. In the multi-class case, NMS selects boxes with the largest prediction scores ignoring the semantic relation between categories of potential election. In contrast, our trainable DPP layer, allowing for Learning Detection with Diverse Proposals (LDDP), considers both label-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
MethodsPruning · Region Proposal Network · Softmax · Convolution · RoIPool · Faster R-CNN
