Shift Equivariance in Object Detection
Marco Manfredi, Yu Wang

TL;DR
This paper introduces a new metric to evaluate shift equivariance in object detection models, revealing that current architectures are sensitive to small input shifts and that existing solutions do not fully address this issue.
Contribution
The paper proposes a novel evaluation metric for shift equivariance in object detection and assesses various methods, highlighting the persistent sensitivity of models to input shifts.
Findings
Modern object detectors are sensitive to even one pixel shifts.
Existing solutions do not achieve full shift equivariance.
The new metric effectively quantifies shift variance in detection models.
Abstract
Robustness to small image translations is a highly desirable property for object detectors. However, recent works have shown that CNN-based classifiers are not shift invariant. It is unclear to what extent this could impact object detection, mainly because of the architectural differences between the two and the dimensionality of the prediction space of modern detectors. To assess shift equivariance of object detection models end-to-end, in this paper we propose an evaluation metric, built upon a greedy search of the lower and upper bounds of the mean average precision on a shifted image set. Our new metric shows that modern object detection architectures, no matter if one-stage or two-stage, anchor-based or anchor-free, are sensitive to even one pixel shift to the input images. Furthermore, we investigate several possible solutions to this problem, both taken from the literature and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
