Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition
Aleix Sant, Jordi Luque, Carlos Escolano

TL;DR
This paper investigates how client language diversity impacts federated learning of multilingual large language models, proposing a new early stopping method and analyzing effects on model quality, fairness, and training efficiency.
Contribution
We extended the FederatedScope-LLM framework for multilingual instruction-tuning and introduced Local Dynamic Early Stopping (LDES-FL) to improve training efficiency and sustainability.
Findings
Monolingual fine-tuning excels for single-language tasks.
Federated training benefits from increased multilinguality for fairness and lower-resource languages.
Client language composition significantly influences model performance, fairness, and training cost.
Abstract
Federated Learning (FL) of Large Language Models (LLMs) in multilingual environments presents significant challenges stemming from heterogeneous language distributions across clients and disparities in language resource availability. To address these challenges, we extended the FederatedScope-LLM framework to support multilingual instruction-tuning experiments with LLMs. We also introduced a novel client-specific early stopping mechanism, Local Dynamic Early Stopping (LDES-FL), which allows clients to pause and resume local training based on client-side validation performance, enhancing training efficiency and sustainability. Through a series of experiments, we studied how client language composition - from fully monolingual to increasingly multilingual clients - affects multilingual quality, fairness and training cost. Monolingual local fine-tuning remains the most effective for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Privacy-Preserving Technologies in Data · Topic Modeling
