Incremental Self-training for Semi-supervised Learning

Jifeng Guo; Zhulin Liu; Tong Zhang; C. L. Philip Chen

arXiv:2404.12398·cs.LG·April 22, 2024·5 cites

Incremental Self-training for Semi-supervised Learning

Jifeng Guo, Zhulin Liu, Tong Zhang, C. L. Philip Chen

PDF

Open Access

TL;DR

This paper introduces Incremental Self-training (IST), a semi-supervised learning method that processes unlabeled data in batches with high certainty pseudo-labels, improving accuracy and speed over traditional self-training.

Contribution

The paper proposes IST, a novel batch-wise, priority-based self-training approach that reduces time consumption and enhances performance in semi-supervised learning.

Findings

01

Outperforms state-of-the-art on three image classification tasks.

02

Improves recognition accuracy across five datasets.

03

Reduces training time compared to traditional self-training.

Abstract

Semi-supervised learning provides a solution to reduce the dependency of machine learning on labeled data. As one of the efficient semi-supervised techniques, self-training (ST) has received increasing attention. Several advancements have emerged to address challenges associated with noisy pseudo-labels. Previous works on self-training acknowledge the importance of unlabeled data but have not delved into their efficient utilization, nor have they paid attention to the problem of high time consumption caused by iterative learning. This paper proposes Incremental Self-training (IST) for semi-supervised learning to fill these gaps. Unlike ST, which processes all data indiscriminately, IST processes data in batches and priority assigns pseudo-labels to unlabeled samples with high certainty. Then, it processes the data around the decision boundary after the model is stabilized, enhancing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Speech Recognition and Synthesis · Machine Learning and ELM