Gradient dynamics for low-rank fine-tuning beyond kernels
Arif Kerem Dayi, Sitan Chen

TL;DR
This paper analyzes the convergence of low-rank fine-tuning methods like LoRA in neural networks, providing theoretical guarantees for gradient descent convergence in a student-teacher model with Gaussian inputs.
Contribution
It offers the first theoretical analysis of low-rank fine-tuning dynamics, showing convergence conditions and iteration complexity beyond simple linear models.
Findings
Gradient descent converges to the teacher model in polynomial time.
Convergence does not depend on the activation's Hermite expansion properties.
Learning from scratch requires significantly more iterations.
Abstract
LoRA has emerged as one of the de facto methods for fine-tuning foundation models with low computational cost and memory footprint. The idea is to only train a low-rank perturbation to the weights of a pre-trained model, given supervised data for a downstream task. Despite its empirical sucess, from a mathematical perspective it remains poorly understood what learning mechanisms ensure that gradient descent converges to useful low-rank perturbations. In this work we study low-rank fine-tuning in a student-teacher setting. We are given the weights of a two-layer base model , as well as i.i.d. samples where is Gaussian and is the teacher model given by perturbing the weights of by a rank-1 matrix. This generalizes the setting of generalized linear model (GLM) regression where the weights of are zero. When the rank-1 perturbation is comparable in norm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Numerical methods in inverse problems · Characterization and Applications of Magnetic Nanoparticles
MethodsGLM · Balanced Selection
