Data Quality Control in Federated Instruction-tuning of Large Language   Models

Yaxin Du; Rui Ye; Fengting Yuchi; Wanru Zhao; Jingjing Qu; and Yanfeng Wang; Siheng Chen

arXiv:2410.11540·cs.LG·March 3, 2025

Data Quality Control in Federated Instruction-tuning of Large Language Models

Yaxin Du, Rui Ye, Fengting Yuchi, Wanru Zhao, Jingjing Qu, and Yanfeng Wang, Siheng Chen

PDF

Open Access

TL;DR

This paper introduces FedDQC, a federated instruction tuning framework with dynamic data quality control that improves large language model performance by filtering and progressively training on high-quality data.

Contribution

The paper presents a novel framework with instruction-response alignment and hierarchical training for dynamic data quality management in federated learning of LLMs.

Findings

01

Significant performance improvements on mixed-quality data.

02

Effective client-side data quality evaluation metric.

03

Progressive training from high- to low-IRA data enhances learning.

Abstract

Federated Learning (FL) enables privacy-preserving collaborative instruction tuning of large language models (LLMs) by leveraging massively distributed data. However, the decentralized nature of FL exacerbates data quality challenges, as local clients lack global visibility to filter noisy or low-quality samples before training. To resolve this issue, we propose FedDQC, a novel federated instruction tuning framework with dynamic data quality control. Our approach introduces two key innovations. First, we propose instruction-response alignment (IRA), an efficient client-side metric for quality evaluation requiring only low-cost inference. We validate that higher-IRA data corresponds to more relevant and easier-to-learn question-answer pairs. Second, mirroring the human easy-to-hard knowledge acquisition process, we design a quality-aware hierarchical FL training framework, where the LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Recommender Systems and Techniques · Privacy-Preserving Technologies in Data