TL;DR
DeCon is a novel approach for long-tailed semi-supervised learning that decouples learning into two branches for head and tail classes, then gradually converges, achieving state-of-the-art results especially when labeled and unlabeled class distributions differ.
Contribution
The paper introduces DeCon, a simple yet effective method that handles unknown unlabeled class distributions by decoupling and converging two specialized learning branches.
Findings
DeCon achieves an average 2.7% accuracy increase over existing methods in mismatched class distributions.
DeCon outperforms many sophisticated LTSSL algorithms even when class distributions are identical.
Extensive ablation studies identify key factors contributing to DeCon's success.
Abstract
While long-tailed semi-supervised learning (LTSSL) has attracted growing attention in many real-world classification tasks, existing LTSSL algorithms typically assume that labeled and unlabeled data share nearly identical class distributions. When this assumption is violated, these methods can perform poorly because they rely on biased model-generated pseudo-labels. To address this issue, we propose a simple yet effective approach called DeCon for LTSSL with unknown unlabeled class distributions. Specifically, DeCon decouples learning into two specialized branches: a standard branch that focuses on head classes and a balanced branch that focuses on tail classes. During training, the two branches interact and gradually converge, allowing them to complement each other and ultimately achieve strong performance across all classes. Despite its simplicity, we show that DeCon achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Machine Learning and ELM
MethodsSoftmax · Attention Is All You Need
