Randomized Asymmetric Chain of LoRA: The First Meaningful Theoretical Framework for Low-Rank Adaptation
Grigory Malinovsky, Umberto Michieli, Hasan Abed Al Kader Hammoud,, Taha Ceritli, Hayder Elesedy, Mete Ozay, Peter Richt\'arik

TL;DR
This paper introduces RAC-LoRA, a theoretical framework that guarantees convergence of low-rank adaptation methods like LoRA, bridging the gap between empirical heuristics and provable optimization guarantees in fine-tuning large models.
Contribution
It provides the first rigorous convergence analysis of LoRA and its variants, proposing RAC-LoRA as a provably convergent optimization framework with theoretical guarantees.
Findings
RAC-LoRA achieves convergence rates comparable to full-parameter fine-tuning.
The framework applies to non-convex loss functions in various learning settings.
Experimental results support the theoretical convergence guarantees.
Abstract
Fine-tuning has become a popular approach to adapting large foundational models to specific tasks. As the size of models and datasets grows, parameter-efficient fine-tuning techniques are increasingly important. One of the most widely used methods is Low-Rank Adaptation (LoRA), with adaptation update expressed as the product of two low-rank matrices. While LoRA was shown to possess strong performance in fine-tuning, it often under-performs when compared to full-parameter fine-tuning (FPFT). Although many variants of LoRA have been extensively studied empirically, their theoretical optimization analysis is heavily under-explored. The starting point of our work is a demonstration that LoRA and its two extensions, Asymmetric LoRA and Chain of LoRA, indeed encounter convergence issues. To address these issues, we propose Randomized Asymmetric Chain of LoRA (RAC-LoRA) -- a general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing
