FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA
Haoran Zhang, Dongjun Kim, Seohyeon Cha, Haris Vikalo

TL;DR
FedRot-LoRA introduces a method to align client updates via orthogonal transformations in federated LoRA, reducing aggregation errors caused by rotational misalignment and improving training stability and performance.
Contribution
It proposes a novel rotational alignment technique for federated LoRA that enhances update aggregation without extra communication costs.
Findings
FedRot-LoRA outperforms existing federated LoRA methods across various tasks.
Rotational alignment reduces aggregation error and stabilizes training.
Theoretical analysis shows tighter error bounds with alignment.
Abstract
Federated LoRA provides a communication-efficient mechanism for fine-tuning large language models on decentralized data. In practice, however, a discrepancy between the factor-wise averaging used to preserve low rank and the mathematically correct aggregation of local updates can cause significant aggregation error and unstable training. We argue that a major source of this problem is rotational misalignment, arising from the rotational invariance of low-rank factorizations -- semantically equivalent updates can be represented in different latent subspaces across clients since . When such misaligned factors are averaged directly, they interfere destructively and degrade the global update. To address this issue, we propose FedRot-LoRA, a federated LoRA framework that aligns client updates via orthogonal transformations prior to aggregation. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
