A Comparative Study of Pre-training and Self-training
Yiheng Wang, Jiayu Lin, Zuoquan Lin

TL;DR
This paper provides an extensive empirical comparison of pre-training and self-training in semi-supervised learning, revealing that pre-training with fine-tuning generally outperforms self-training across various datasets and settings.
Contribution
It introduces a comprehensive ensemble method to systematically compare all feasible training paradigms combining pre-training, self-training, and fine-tuning under consistent conditions.
Findings
Pre-training with fine-tuning achieves the best overall performance.
Self-training does not add benefits when combined with semi-supervised pre-training.
Experiments conducted on six datasets with various data augmentation and imbalance scenarios.
Abstract
Pre-training and self-training are two approaches to semi-supervised learning. The comparison between pre-training and self-training has been explored. However, the previous works led to confusing findings: self-training outperforms pre-training experienced on some tasks in computer vision, and contrarily, pre-training outperforms self-training experienced on some tasks in natural language processing, under certain conditions of incomparable settings. We propose, comparatively and exhaustively, an ensemble method to empirical study all feasible training paradigms combining pre-training, self-training, and fine-tuning within consistent foundational settings comparable to data augmentation. We conduct experiments on six datasets, four data augmentation, and imbalanced data for sentiment analysis and natural language inference tasks. Our findings confirm that the pre-training and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Resource Development and Performance Evaluation
