LoRA-DA: Data-Aware Initialization for Low-Rank Adaptation via Asymptotic Analysis
Qingyue Zhang, Chang Chu, Tianren Peng, Qi Li, Xiangyang Luo, Zhihao Jiang, Shao-Lun Huang

TL;DR
This paper introduces LoRA-DA, a data-aware initialization method for Low-Rank Adaptation that leverages target-domain data and asymptotic analysis to improve fine-tuning performance.
Contribution
It develops a theoretical framework and an efficient algorithm for data-aware LoRA initialization, enhancing accuracy, convergence speed, and robustness.
Findings
LoRA-DA outperforms existing initialization methods across multiple benchmarks.
LoRA-DA achieves faster and more stable convergence.
It requires only a small overhead for initialization.
Abstract
LoRA has become a widely adopted method for PEFT, and its initialization methods have attracted increasing attention. However, existing methods have notable limitations: many methods do not incorporate target-domain data, while gradient-based methods exploit data only at a shallow level by relying on one-step gradient decomposition. In this paper, we establish a theoretical framework for data-aware LoRA initialization. Starting from minimizing the expectation of the parameter discrepancy between the fine-tuned and target models, we derive an optimization problem with two components: a bias term, which is related to the parameter distance between the fine-tuned and target models, and is approximated using a Fisher-gradient formulation to preserve anisotropy; and a variance term, which accounts for the uncertainty introduced by sampling stochasticity through the Fisher information.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
