Leveraging Unstructured Text Data for Federated Instruction Tuning of   Large Language Models

Rui Ye; Rui Ge; Yuchi Fengting; Jingyi Chai; Yanfeng Wang; Siheng Chen

arXiv:2409.07136·cs.CL·September 12, 2024

Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models

Rui Ye, Rui Ge, Yuchi Fengting, Jingyi Chai, Yanfeng Wang, Siheng Chen

PDF

Open Access

TL;DR

This paper introduces FedIT-U2S, a framework that automatically converts unstructured text data into structured instruction-response pairs for federated instruction tuning of large language models, broadening its practical applicability.

Contribution

FedIT-U2S is a novel framework that enables federated instruction tuning using unstructured text data by automatic data generation and retrieval-based example selection.

Findings

01

Consistent improvement over base LLM across three domains.

02

Effective automatic transformation of unstructured data into instruction-response pairs.

03

Enhanced flexibility in federated instruction tuning scenarios.

Abstract

Federated instruction tuning enables multiple clients to collaboratively fine-tune a shared large language model (LLM) that can follow humans' instructions without directly sharing raw data. However, existing literature impractically requires that all the clients readily hold instruction-tuning data (i.e., structured instruction-response pairs), which necessitates massive human annotations since clients' data is usually unstructured text instead. Addressing this, we propose a novel and flexible framework FedIT-U2S, which can automatically transform unstructured corpus into structured data for federated instruction tuning. FedIT-U2S consists two key steps: (1) few-shot instruction-tuning data generation, where each unstructured data piece together with several examples is combined to prompt an LLM in generating an instruction-response pair. To further enhance the flexibility, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsBalanced Selection