Enhanced Long-Tailed Recognition with Contrastive CutMix Augmentation
Haolin Pan, Yong Guo, Mianjie Yu, Jian Chen

TL;DR
This paper introduces ConCutMix, a novel data augmentation method that uses contrastive learning to generate semantically consistent labels for long-tailed recognition, significantly improving tail class accuracy.
Contribution
The paper proposes a contrastive learning-based label rectification for CutMix augmentation, enhancing long-tailed recognition performance.
Findings
3.0% overall accuracy improvement on ImageNet-LT
3.3% accuracy increase on tail classes
Effective generalization across benchmarks and models
Abstract
Real-world data often follows a long-tailed distribution, where a few head classes occupy most of the data and a large number of tail classes only contain very limited samples. In practice, deep models often show poor generalization performance on tail classes due to the imbalanced distribution. To tackle this, data augmentation has become an effective way by synthesizing new samples for tail classes. Among them, one popular way is to use CutMix that explicitly mixups the images of tail classes and the others, while constructing the labels according to the ratio of areas cropped from two images. However, the area-based labels entirely ignore the inherent semantic information of the augmented samples, often leading to misleading training signals. To address this issue, we propose a Contrastive CutMix (ConCutMix) that constructs augmented samples with semantically consistent labels to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Neural Networks and Applications · Face and Expression Recognition
MethodsCutMix
