TL;DR
MatryoshkaLoRA introduces a hierarchical low-rank adaptation framework for efficient fine-tuning of large language models, improving accuracy and performance trade-offs across ranks.
Contribution
It proposes a simple yet effective hierarchical low-rank training method that outperforms prior rank-adaptive approaches in accuracy and efficiency.
Findings
MatryoshkaLoRA learns more accurate hierarchical low-rank representations.
It achieves superior accuracy-performance trade-offs across ranks.
Supports dynamic rank selection with minimal accuracy loss.
Abstract
With the rise in scale for deep learning models to billions of parameters, the computational cost of fine-tuning remains a significant barrier to deployment. While Low-Rank Adaptation (LoRA) has become the standard for parameter-efficient fine-tuning, the need to set a predefined, static rank requires exhaustive grid searches to balance efficiency and performance. Existing rank-adaptive solutions such as DyLoRA mitigate this by sampling ranks during the training from a predefined distribution. However, they often yield sub-optimal results at higher ranks due to lack of consistent gradient signals across the full hierarchy of ranks, thus making these methods data-inefficient. In this paper, we propose MatryoshkaLoRA, a general, Matryoshka-inspired training framework for LoRA that learns accurate hierarchical low-rank representations by inserting a fixed, carefully crafted diagonal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
