Humans disagree with the IoU for measuring object detector localization   error

Ombretta Strafforello; Vanathi Rajasekart; Osman S. Kayhan; Oana Inel,; Jan van Gemert

arXiv:2207.14221·cs.CV·July 29, 2022

Humans disagree with the IoU for measuring object detector localization error

Ombretta Strafforello, Vanathi Rajasekart, Osman S. Kayhan, Oana Inel,, Jan van Gemert

PDF

Open Access 1 Repo

TL;DR

This paper reveals that human perception of object localization errors differs from IoU scores, suggesting IoU alone may not fully capture human judgment of localization quality.

Contribution

It is the first study to compare human judgments with IoU scores for object detector localization errors, highlighting the limitations of IoU as an evaluation metric.

Findings

01

Humans do not consider equal IoU scores as equally acceptable.

02

Participants show preferences for certain localization errors over others with the same IoU.

03

IoU scores alone may not align with human perception of localization quality.

Abstract

The localization quality of automatic object detectors is typically evaluated by the Intersection over Union (IoU) score. In this work, we show that humans have a different view on localization quality. To evaluate this, we conduct a survey with more than 70 participants. Results show that for localization errors with the exact same IoU score, humans might not consider that these errors are equal, and express a preference. Our work is the first to evaluate IoU with humans and makes it clear that relying on IoU scores alone to evaluate localization errors might not be sufficient.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ombretta/humans_vs_iou
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Mobile Crowdsensing and Crowdsourcing