ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
Dongha Lee, Jinhee Park, Minjun Kim, Junseok Kwon

TL;DR
ABM-LoRA introduces an activation boundary alignment technique that significantly speeds up the convergence of low-rank adapters by reducing information loss at initialization, proven effective across multiple architectures and tasks.
Contribution
It presents a novel initialization strategy for low-rank adapters that aligns activation boundaries to improve convergence speed and performance.
Findings
Accelerates convergence across diverse models and tasks.
Achieves state-of-the-art accuracy on VTAB-1K.
Enhances performance on structured reasoning tasks.
Abstract
We propose Activation Boundary Matching for Low-Rank Adaptation (ABM-LoRA), a principled initialization strategy that substantially accelerates the convergence of low-rank adapters. While LoRA offers high parameter efficiency, its random initialization restricts gradient updates to a mismatched tangent space, causing significant information loss and hindering early convergence. Our ABM-LoRA addresses this by aligning the adapter's activation boundaries with those of the pretrained model before downstream training, thereby maximizing the projection of full-parameter gradients into the adapter subspace. This alignment sharply reduces information loss at initialization, yields a lower starting loss, and accelerates convergence. We demonstrate ABM-LoRA's effectiveness across diverse architectures and tasks: language understanding (T5-Base on GLUE), dialogue generation (LLaMA2-7B on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling
