Image-Level or Object-Level? A Tale of Two Resampling Strategies for   Long-Tailed Detection

Nadine Chang; Zhiding Yu; Yu-Xiong Wang; Anima Anandkumar; Sanja; Fidler; Jose M. Alvarez

arXiv:2104.05702·cs.CV·October 20, 2021·5 cites

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection

Nadine Chang, Zhiding Yu, Yu-Xiong Wang, Anima Anandkumar, Sanja, Fidler, Jose M. Alvarez

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a unified resampling strategy combining image-level and object-level approaches to improve long-tailed detection, utilizing an object-centric memory replay to enhance class balance and detection performance.

Contribution

It proposes a novel joint resampling strategy (RIO) that unifies image and object-level resampling for long-tailed detection, with an object-centric memory replay mechanism.

Findings

01

Outperforms state-of-the-art methods on LVIS v0.5 dataset

02

Effective object-level resampling with minimal extra computation

03

Implicit feature augmentation from model updates enhances detection

Abstract

Training on datasets with long-tailed distributions has been challenging for major recognition tasks such as classification and detection. To deal with this challenge, image resampling is typically introduced as a simple but effective approach. However, we observe that long-tailed detection differs from classification since multiple classes may be present in one image. As a result, image resampling alone is not enough to yield a sufficiently balanced distribution at the object level. We address object-level resampling by introducing an object-centric memory replay strategy based on dynamic, episodic memory banks. Our proposed strategy has two benefits: 1) convenient object-level resampling without significant extra computation, and 2) implicit feature-level augmentation from model updates. We show that image-level and object-level resamplings are both important, and thus unify them with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NVlabs/RIO
pytorchOfficial

Videos

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning