Seeing without Looking: Contextual Rescoring of Object Detections for AP   Maximization

Louren\c{c}o V. Pato; Renato Negrinho; Pedro M. Q. Aguiar

arXiv:1912.12290·cs.CV·March 31, 2020

Seeing without Looking: Contextual Rescoring of Object Detections for AP Maximization

Louren\c{c}o V. Pato, Renato Negrinho, Pedro M. Q. Aguiar

PDF

1 Repo 1 Video

TL;DR

This paper introduces a context-based post-processing method using a bidirectional RNN with attention to rescore object detection confidences, improving AP by leveraging detection relationships without visual features.

Contribution

It presents a novel, architecture-agnostic rescoring approach that enhances detection accuracy by modeling contextual information among detections, improving AP without additional visual feature computation.

Findings

01

Consistent AP improvements over baseline detectors.

02

Effective reduction of duplicate and out-of-context detections.

03

Method is computationally inexpensive and architecture-agnostic.

Abstract

The majority of current object detectors lack context: class predictions are made independently from other detections. We propose to incorporate context in object detection by post-processing the output of an arbitrary detector to rescore the confidences of its detections. Rescoring is done by conditioning on contextual information from the entire set of detections: their confidences, predicted classes, and positions. We show that AP can be improved by simply reassigning the detection confidence values such that true positives that survive longer (i.e., those with the correct class and large IoU) are scored higher than false positives or detections with small IoU. In this setting, we use a bidirectional RNN with attention for contextual rescoring and introduce a training target that uses the IoU with ground truth to maximize AP for the given set of detections. The fact that our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LourencoVazPato/seeing-without-looking
pytorchOfficial

Videos

Seeing without Looking: Contextual Rescoring of Object Detections for AP Maximization· youtube

Taxonomy

MethodsRegion Proposal Network · Softmax · Convolution · RoIPool · Faster R-CNN