FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

Haoran Zhang; Dongjun Kim; Seohyeon Cha; Haris Vikalo

arXiv:2602.23638·cs.LG·May 13, 2026

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

Haoran Zhang, Dongjun Kim, Seohyeon Cha, Haris Vikalo

PDF

TL;DR

FedRot-LoRA introduces a method to align client updates via orthogonal transformations in federated LoRA, reducing aggregation errors caused by rotational misalignment and improving training stability and performance.

Contribution

It proposes a novel rotational alignment technique for federated LoRA that enhances update aggregation without extra communication costs.

Findings

01

FedRot-LoRA outperforms existing federated LoRA methods across various tasks.

02

Rotational alignment reduces aggregation error and stabilizes training.

03

Theoretical analysis shows tighter error bounds with alignment.

Abstract

Federated LoRA provides a communication-efficient mechanism for fine-tuning large language models on decentralized data. In practice, however, a discrepancy between the factor-wise averaging used to preserve low rank and the mathematically correct aggregation of local updates can cause significant aggregation error and unstable training. We argue that a major source of this problem is rotational misalignment, arising from the rotational invariance of low-rank factorizations -- semantically equivalent updates can be represented in different latent subspaces across clients since $(B_{i} R_{i}) (R_{i}^{⊤} A_{i}) = B_{i} A_{i}$ . When such misaligned factors are averaged directly, they interfere destructively and degrade the global update. To address this issue, we propose FedRot-LoRA, a federated LoRA framework that aligns client updates via orthogonal transformations prior to aggregation. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.