OpenFedLLM: Training Large Language Models on Decentralized Private Data   via Federated Learning

Rui Ye; Wenhao Wang; Jingyi Chai; Dihan Li; Zexi Li; Yinda Xu; Yaxin; Du; Yanfeng Wang; Siheng Chen

arXiv:2402.06954·cs.LG·February 13, 2024·6 cites

OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning

Rui Ye, Wenhao Wang, Jingyi Chai, Dihan Li, Zexi Li, Yinda Xu, Yaxin, Du, Yanfeng Wang, Siheng Chen

PDF

Open Access 3 Repos

TL;DR

This paper introduces OpenFedLLM, a federated learning framework for training large language models on private data, demonstrating improved performance and privacy preservation compared to local training, with models outperforming some existing benchmarks.

Contribution

It presents a comprehensive, research-friendly framework for federated LLM training, including instruction tuning, value alignment, and evaluation across diverse datasets and algorithms.

Findings

01

FL algorithms outperform local training in LLMs

02

Llama2-7B with FL surpasses GPT-4 on financial benchmarks

03

OpenFedLLM supports diverse domains and extensive evaluations

Abstract

Trained on massive publicly available data, large language models (LLMs) have demonstrated tremendous success across various fields. While more data contributes to better performance, a disconcerting reality is that high-quality public data will be exhausted in a few years. In this paper, we offer a potential next step for contemporary LLMs: collaborative and privacy-preserving LLM training on the underutilized distributed private data via federated learning (FL), where multiple data owners collaboratively train a shared model without transmitting raw data. To achieve this, we build a concise, integrated, and research-friendly framework/codebase, named OpenFedLLM. It covers federated instruction tuning for enhancing instruction-following capability, federated value alignment for aligning with human values, and 7 representative FL algorithms. Besides, OpenFedLLM supports training on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting

MethodsPosition-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Absolute Position Encodings · Softmax · Byte Pair Encoding · Linear Layer · Attention Is All You Need · Dropout · Multi-Head Attention