A Comparative Study of Pre-training and Self-training

Yiheng Wang; Jiayu Lin; Zuoquan Lin

arXiv:2409.02751·cs.CL·September 5, 2024

A Comparative Study of Pre-training and Self-training

Yiheng Wang, Jiayu Lin, Zuoquan Lin

PDF

Open Access 1 Repo

TL;DR

This paper provides an extensive empirical comparison of pre-training and self-training in semi-supervised learning, revealing that pre-training with fine-tuning generally outperforms self-training across various datasets and settings.

Contribution

It introduces a comprehensive ensemble method to systematically compare all feasible training paradigms combining pre-training, self-training, and fine-tuning under consistent conditions.

Findings

01

Pre-training with fine-tuning achieves the best overall performance.

02

Self-training does not add benefits when combined with semi-supervised pre-training.

03

Experiments conducted on six datasets with various data augmentation and imbalance scenarios.

Abstract

Pre-training and self-training are two approaches to semi-supervised learning. The comparison between pre-training and self-training has been explored. However, the previous works led to confusing findings: self-training outperforms pre-training experienced on some tasks in computer vision, and contrarily, pre-training outperforms self-training experienced on some tasks in natural language processing, under certain conditions of incomparable settings. We propose, comparatively and exhaustively, an ensemble method to empirical study all feasible training paradigms combining pre-training, self-training, and fine-tuning within consistent foundational settings comparable to data augmentation. We conduct experiments on six datasets, four data augmentation, and imbalanced data for sentiment analysis and natural language inference tasks. Our findings confirm that the pre-training and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PKUAI-LINGroup/PAS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Resource Development and Performance Evaluation