Enhanced Long-Tailed Recognition with Contrastive CutMix Augmentation

Haolin Pan; Yong Guo; Mianjie Yu; Jian Chen

arXiv:2407.04911·cs.CV·July 9, 2024·1 cites

Enhanced Long-Tailed Recognition with Contrastive CutMix Augmentation

Haolin Pan, Yong Guo, Mianjie Yu, Jian Chen

PDF

Open Access 2 Repos

TL;DR

This paper introduces ConCutMix, a novel data augmentation method that uses contrastive learning to generate semantically consistent labels for long-tailed recognition, significantly improving tail class accuracy.

Contribution

The paper proposes a contrastive learning-based label rectification for CutMix augmentation, enhancing long-tailed recognition performance.

Findings

01

3.0% overall accuracy improvement on ImageNet-LT

02

3.3% accuracy increase on tail classes

03

Effective generalization across benchmarks and models

Abstract

Real-world data often follows a long-tailed distribution, where a few head classes occupy most of the data and a large number of tail classes only contain very limited samples. In practice, deep models often show poor generalization performance on tail classes due to the imbalanced distribution. To tackle this, data augmentation has become an effective way by synthesizing new samples for tail classes. Among them, one popular way is to use CutMix that explicitly mixups the images of tail classes and the others, while constructing the labels according to the ratio of areas cropped from two images. However, the area-based labels entirely ignore the inherent semantic information of the augmented samples, often leading to misleading training signals. To address this issue, we propose a Contrastive CutMix (ConCutMix) that constructs augmented samples with semantically consistent labels to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Neural Networks and Applications · Face and Expression Recognition

MethodsCutMix