Sharp Generalization Bounds for Foundation Models with Asymmetric Randomized Low-Rank Adapters
Anastasis Kratsios, Tin Sum Cheng, Aurelien Lucchi, Haitz S\'aez de Oc\'ariz Borde

TL;DR
This paper provides a theoretical analysis of asymmetric LoRA, a parameter-efficient fine-tuning method for foundation models, focusing on generalization bounds and the behavior of single training runs with random factors.
Contribution
It offers the first concentration results for the generalization gap of a single asymmetric LoRA run and establishes matching upper and lower bounds on sample complexity.
Findings
Sample complexity is $ ilde{O}(rac{ ext{sqrt}(r)}{ ext{sqrt}(N)})$ with high probability.
Fundamental limits show a lower bound of $rac{1}{ ext{sqrt}(N)}$ on sample efficiency.
Insights into the reliability of single-run fine-tuning with asymmetric LoRA.
Abstract
Low-Rank Adaptation (LoRA) has emerged as a widely adopted parameter-efficient fine-tuning (PEFT) technique for foundation models. Recent work has highlighted an inherent asymmetry in the initialization of LoRA's low-rank factors, which has been present since its inception and was presumably derived experimentally. This paper focuses on providing a comprehensive theoretical characterization of asymmetric LoRA with frozen random factors. First, while existing research provides upper-bound generalization guarantees based on averages over multiple experiments, the behaviour of a single fine-tuning run with specific random factors remains an open question. We address this by investigating the concentration of the typical LoRA generalization gap around its mean. Our main upper bound reveals a sample complexity of with high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Statistical Methods and Inference · Hydrological Forecasting Using AI
