Query-based Hard-Image Retrieval for Object Detection at Test Time
Edward Ayers, Jonathan Sadeghi, John Redford, Romain Mueller, Puneet, K. Dokania

TL;DR
This paper introduces a post-hoc, query-based method for identifying hard images in object detection tasks, enabling fine-grained failure analysis without requiring ground-truth labels or retraining of detectors.
Contribution
It proposes a novel, detector-agnostic approach to retrieve hard images based on customizable queries, using Monte Carlo estimation without ground-truth annotations.
Findings
Successfully applied to multiple detectors like RetinaNet and Faster-RCNN
Effectively ranks and classifies hard images for various query types
Operates without labeled data or detector retraining
Abstract
There is a longstanding interest in capturing the error behaviour of object detectors by finding images where their performance is likely to be unsatisfactory. In real-world applications such as autonomous driving, it is also crucial to characterise potential failures beyond simple requirements of detection performance. For example, a missed detection of a pedestrian close to an ego vehicle will generally require closer inspection than a missed detection of a car in the distance. The problem of predicting such potential failures at test time has largely been overlooked in the literature and conventional approaches based on detection uncertainty fall short in that they are agnostic to such fine-grained characterisation of errors. In this work, we propose to reformulate the problem of finding "hard" images as a query-based hard image retrieval task, where queries are specific definitions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Medical Image Segmentation Techniques
MethodsTest · 1x1 Convolution · Convolution · Feature Pyramid Network · Focal Loss · RetinaNet
