MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Ionut-Vlad Modoranu; Mher Safaryan; Dan Alistarh

arXiv:2605.07850·cs.CL·May 11, 2026

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Ionut-Vlad Modoranu, Mher Safaryan, Dan Alistarh

PDF

1 Repo

TL;DR

MatryoshkaLoRA introduces a hierarchical low-rank adaptation framework for efficient fine-tuning of large language models, improving accuracy and performance trade-offs across ranks.

Contribution

It proposes a simple yet effective hierarchical low-rank training method that outperforms prior rank-adaptive approaches in accuracy and efficiency.

Findings

01

MatryoshkaLoRA learns more accurate hierarchical low-rank representations.

02

It achieves superior accuracy-performance trade-offs across ranks.

03

Supports dynamic rank selection with minimal accuracy loss.

Abstract

With the rise in scale for deep learning models to billions of parameters, the computational cost of fine-tuning remains a significant barrier to deployment. While Low-Rank Adaptation (LoRA) has become the standard for parameter-efficient fine-tuning, the need to set a predefined, static rank $r$ requires exhaustive grid searches to balance efficiency and performance. Existing rank-adaptive solutions such as DyLoRA mitigate this by sampling ranks during the training from a predefined distribution. However, they often yield sub-optimal results at higher ranks due to lack of consistent gradient signals across the full hierarchy of ranks, thus making these methods data-inefficient. In this paper, we propose MatryoshkaLoRA, a general, Matryoshka-inspired training framework for LoRA that learns accurate hierarchical low-rank representations by inserting a fixed, carefully crafted diagonal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IST-DASLab/MatryoshkaLoRA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.