Beyond Parameter Aggregation: Semantic Consensus for Federated Fine-Tuning of LLMs
Amr Abourayya, Jens Kleesiek, Michael Kamp

TL;DR
This paper proposes a behavior-based federated fine-tuning method for large language models that reduces communication costs and accommodates heterogeneity, outperforming traditional parameter aggregation approaches.
Contribution
It introduces a novel semantic consensus approach that exchanges model outputs instead of parameters, enabling scalable and flexible federated LLM fine-tuning.
Findings
Achieves comparable performance to strong baselines.
Reduces communication by orders of magnitude.
Supports heterogeneous architectures and open-ended generation.
Abstract
Federated fine-tuning of large language models is commonly formulated as a parameter aggregation problem. However, even parameter-efficient methods require transmitting large collections of trainable weights, assume aligned architectures, and rely on white-box access to model parameters. As model sizes continue to grow and deployments become increasingly heterogeneous, these assumptions become progressively misaligned with practical constraints. We consider an alternative formulation in which collaboration is mediated through model behavior rather than parameters. Clients fine-tune local models on private data and exchange generated outputs on a shared, public prompt set. The server maps these outputs into a semantic representation space, forms a per-prompt semantic consensus, and returns pseudo-labels for further local fine-tuning. This formulation fundamentally changes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
