Semi-supervised Fine-tuning for Large Language Models

Junyu Luo; Xiao Luo; Xiusi Chen; Zhiping Xiao; Wei Ju; Ming Zhang

arXiv:2410.14745·cs.CL·February 20, 2025·2 cites

Semi-supervised Fine-tuning for Large Language Models

Junyu Luo, Xiao Luo, Xiusi Chen, Zhiping Xiao, Wei Ju, Ming Zhang

PDF

Open Access 2 Repos 3 Models 1 Datasets

TL;DR

This paper introduces SemiEvol, a semi-supervised fine-tuning framework for large language models that effectively leverages both labeled and unlabeled data to improve domain-specific performance.

Contribution

The paper proposes SemiEvol, a novel semi-supervised fine-tuning approach with a propagate-and-select mechanism for better knowledge transfer in LLMs.

Findings

01

SemiEvol outperforms traditional SFT and self-evolution methods.

02

Significant performance improvements on seven datasets.

03

Effective in hybrid labeled and unlabeled data scenarios.

Abstract

Supervised fine-tuning (SFT) is crucial in adapting large language model (LLMs) to a specific domain or task. However, only a limited amount of labeled data is available in practical applications, which poses a severe challenge for SFT in yielding satisfactory results. Therefore, a data-efficient framework that can fully exploit labeled and unlabeled data for LLM fine-tuning is highly anticipated.Towards this end, we introduce a semi-supervised fine-tuning(SemiFT) task and a framework named SemiEvol for LLM alignment from a propagate-and-select manner. For knowledge propagation, SemiEvol adopts a bi-level approach, propagating knowledge from labeled data to unlabeled data through both in-weight and in-context methods. For knowledge selection, SemiEvol incorporates a collaborative learning mechanism, selecting higher-quality pseudo-response samples. We conducted experiments using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

luojunyu/SemiEvol
dataset· 1.6k dl
1.6k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsShrink and Fine-Tune