Revisiting Federated Fine-Tuning: A Single Communication Round is Enough for Foundation Models
Ziyao Wang, Bowei Tian, Yexiao He, Zheyu Shen, Guoheng Sun, Yuhan Liu, Luyang Liu, Meng Liu, Ang Li

TL;DR
This paper demonstrates that a single communication round in federated fine-tuning of foundation models achieves comparable performance to multiple rounds, significantly reducing communication costs and enabling more practical, privacy-preserving federated learning.
Contribution
The paper provides both theoretical and empirical evidence that one-shot federated fine-tuning is sufficient for large foundation models, challenging the necessity of multi-round aggregation.
Findings
One-shot federated fine-tuning achieves similar performance to multi-round methods.
Large foundation models benefit from lower training loss in one-shot settings.
Communication costs are significantly reduced with single-round aggregation.
Abstract
The recent advancement of foundation models (FMs) has increased the demand for fine-tuning these models on large-scale cross-domain datasets. To address this, federated fine-tuning has emerged, allowing FMs to be fine-tuned on distributed datasets across multiple devices while ensuring data privacy. However, the substantial parameter size and the multi-round communication in federated learning algorithms result in prohibitively high communication costs, challenging the practicality of federated fine-tuning. In this paper, we identify and analyze, both theoretically and empirically, that the traditional multi-round aggregation algorithms may not be necessary for federated fine-tuning large FMs. Our experiments reveal that a single round of aggregation (i.e., one-shot federated fine-tuning) yields a global model performance comparable to that achieved through multiple rounds of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Modeling in Geospatial Applications · Model-Driven Software Engineering Techniques
