RobustFT: Robust Supervised Fine-tuning for Large Language Models under   Noisy Response

Junyu Luo; Xiao Luo; Kaize Ding; Jingyang Yuan; Zhiping Xiao; Ming; Zhang

arXiv:2412.14922·cs.CL·December 20, 2024

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Junyu Luo, Xiao Luo, Kaize Ding, Jingyang Yuan, Zhiping Xiao, Ming, Zhang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

RobustFT is a novel framework that improves supervised fine-tuning of large language models by effectively detecting and relabeling noisy data, leading to enhanced performance in noisy environments.

Contribution

The paper introduces RobustFT, a noise-robust fine-tuning framework that combines multi-expert noise detection, context-aware relabeling, and entropy-based data selection for large language models.

Findings

01

RobustFT significantly outperforms baseline methods in noisy data scenarios.

02

The multi-expert noise detection system improves noise identification accuracy.

03

Entropy-based sample selection enhances the quality of fine-tuning data.

Abstract

Supervised fine-tuning (SFT) plays a crucial role in adapting large language models (LLMs) to specific domains or tasks. However, as demonstrated by empirical experiments, the collected data inevitably contains noise in practical applications, which poses significant challenges to model performance on downstream tasks. Therefore, there is an urgent need for a noise-robust SFT framework to enhance model capabilities in downstream tasks. To address this challenge, we introduce a robust SFT framework (RobustFT) that performs noise detection and relabeling on downstream task data. For noise identification, our approach employs a multi-expert collaborative system with inference-enhanced models to achieve superior noise detection. In the denoising phase, we utilize a context-enhanced strategy, which incorporates the most relevant and confident knowledge followed by careful assessment to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luo-junyu/robustft
noneOfficial

Datasets

luojunyu/RobustFT
dataset· 85 dl
85 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling · Natural Language Processing Techniques

MethodsShrink and Fine-Tune