TL;DR
The paper introduces Localisation Recall Precision (LRP) Error, a new evaluation metric for visual detection tasks that addresses limitations of existing metrics like AP and PQ by incorporating localisation and classification quality, and is applicable across various detection tasks.
Contribution
It proposes LRP Error and Optimal LRP (oLRP) Error as new, robust metrics for evaluating and optimizing visual detection models across multiple tasks and datasets.
Findings
LRP Error provides richer, more discriminative evaluation than AP and PQ.
Empirical analysis on nearly 100 detectors across 7 tasks shows LRP Error's effectiveness.
LRP Error is applicable to all visual detection outputs, including those without confidence scores.
Abstract
Despite being widely used as a performance measure for visual detection tasks, Average Precision (AP) is limited in (i) reflecting localisation quality, (ii) interpretability and (iii) robustness to the design choices regarding its computation, and its applicability to outputs without confidence scores. Panoptic Quality (PQ), a measure proposed for evaluating panoptic segmentation (Kirillov et al., 2019), does not suffer from these limitations but is limited to panoptic segmentation. In this paper, we propose Localisation Recall Precision (LRP) Error as the average matching error of a visual detector computed based on both its localisation and classification qualities for a given confidence score threshold. LRP Error, initially proposed only for object detection by Oksuz et al. (2018), does not suffer from the aforementioned limitations and is applicable to all visual detection tasks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
