Rethinking the Value of Labels for Improving Class-Imbalanced Learning
Yuzhe Yang, Zhi Xu

TL;DR
This paper investigates the complex role of labels in class-imbalanced learning, revealing that while labels can be beneficial when combined with unlabeled data, self-supervised pretraining often outperforms label-based methods, challenging traditional assumptions.
Contribution
The study provides a theoretical and empirical analysis showing that imbalanced labels can be both beneficial and detrimental, depending on the learning approach, and proposes strategies to leverage unlabeled data effectively.
Findings
Imbalanced labels help reduce bias when combined with unlabeled data.
Self-supervised pretraining outperforms supervised methods on imbalanced datasets.
Theoretical insights guide improved semi-supervised and self-supervised learning strategies.
Abstract
Real-world data often exhibits long-tailed distributions with heavy class imbalance, posing great challenges for deep recognition models. We identify a persisting dilemma on the value of labels in the context of imbalanced learning: on the one hand, supervision from labels typically leads to better results than its unsupervised counterparts; on the other hand, heavily imbalanced data naturally incurs "label bias" in the classifier, where the decision boundary can be drastically altered by the majority classes. In this work, we systematically investigate these two facets of labels. We demonstrate, theoretically and empirically, that class-imbalanced learning can significantly benefit in both semi-supervised and self-supervised manners. Specifically, we confirm that (1) positively, imbalanced labels are valuable: given more unlabeled data, the original labels can be leveraged with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImbalanced Data Classification Techniques
