Decouple then Converge: Handling Unknown Unlabeled Distributions in Long-Tailed Semi-Supervised Learning

Kai Gan; Tong Wei; Min-Ling Zhang

arXiv:2406.13187·cs.LG·May 19, 2026

Decouple then Converge: Handling Unknown Unlabeled Distributions in Long-Tailed Semi-Supervised Learning

Kai Gan, Tong Wei, Min-Ling Zhang

PDF

1 Repo

TL;DR

DeCon is a novel approach for long-tailed semi-supervised learning that decouples learning into two branches for head and tail classes, then gradually converges, achieving state-of-the-art results especially when labeled and unlabeled class distributions differ.

Contribution

The paper introduces DeCon, a simple yet effective method that handles unknown unlabeled class distributions by decoupling and converging two specialized learning branches.

Findings

01

DeCon achieves an average 2.7% accuracy increase over existing methods in mismatched class distributions.

02

DeCon outperforms many sophisticated LTSSL algorithms even when class distributions are identical.

03

Extensive ablation studies identify key factors contributing to DeCon's success.

Abstract

While long-tailed semi-supervised learning (LTSSL) has attracted growing attention in many real-world classification tasks, existing LTSSL algorithms typically assume that labeled and unlabeled data share nearly identical class distributions. When this assumption is violated, these methods can perform poorly because they rely on biased model-generated pseudo-labels. To address this issue, we propose a simple yet effective approach called DeCon for LTSSL with unknown unlabeled class distributions. Specifically, DeCon decouples learning into two specialized branches: a standard branch that focuses on head classes and a balanced branch that focuses on tail classes. During training, the two branches interact and gradually converge, allowing them to complement each other and ultimately achieve strong performance across all classes. Despite its simplicity, we show that DeCon achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Gank0078/DeCon
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Machine Learning and ELM

MethodsSoftmax · Attention Is All You Need