LayerMatch: Do Pseudo-labels Benefit All Layers?
Chaoqi Liang, Guanglei Yang, Lifeng Qiao, Zitong Huang, Hongliang Yan,, Yunchao Wei, Wangmeng Zuo

TL;DR
LayerMatch introduces layer-specific pseudo-label strategies in semi-supervised learning, improving performance by mitigating noise effects and accelerating feature clustering, validated by extensive experiments.
Contribution
The paper proposes two novel layer-specific pseudo-label strategies, Grad-ReLU and Avg-Clustering, to enhance semi-supervised learning performance.
Findings
Achieves 10.38% improvement over baseline
Increases accuracy by 2.44% over state-of-the-art
Effectively mitigates noisy pseudo-label interference
Abstract
Deep neural networks have achieved remarkable performance across various tasks when supplied with large-scale labeled data. However, the collection of labeled data can be time-consuming and labor-intensive. Semi-supervised learning (SSL), particularly through pseudo-labeling algorithms that iteratively assign pseudo-labels for self-training, offers a promising solution to mitigate the dependency of labeled data. Previous research generally applies a uniform pseudo-labeling strategy across all model layers, assuming that pseudo-labels exert uniform influence throughout. Contrasting this, our theoretical analysis and empirical experiment demonstrate feature extraction layer and linear classification layer have distinct learning behaviors in response to pseudo-labels. Based on these insights, we develop two layer-specific pseudo-label strategies, termed Grad-ReLU and Avg-Clustering.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Semantic Web and Ontologies · Natural Language Processing Techniques
