Flexi-LoRA with Input-Adaptive Ranks: Efficient Finetuning for Speech and Reasoning Tasks

Zongqian Li; Yixuan Su; Han Zhou; Zihao Fu; Nigel Collier

arXiv:2605.01959·cs.LG·May 5, 2026

Flexi-LoRA with Input-Adaptive Ranks: Efficient Finetuning for Speech and Reasoning Tasks

Zongqian Li, Yixuan Su, Han Zhou, Zihao Fu, Nigel Collier

PDF

TL;DR

Flexi-LoRA introduces input-adaptive rank adjustment for efficient fine-tuning of large models, enhancing performance and reasoning quality across speech and reasoning tasks.

Contribution

It proposes a dynamic, input-dependent LoRA framework that outperforms static methods with fewer parameters, improving adaptability and reasoning in large language models.

Findings

01

Input-dependent parameter allocation improves performance.

02

Consistency between training and inference is crucial for effectiveness.

03

Task-specific rank dynamics vary, especially in reasoning tasks.

Abstract

Parameter-efficient fine-tuning methods like Low-Rank Adaptation (LoRA) have become essential for deploying large language models, yet their static parameter allocation remains suboptimal for inputs of varying complexity. We present Flexi-LoRA, a novel framework that dynamically adjusts LoRA ranks based on input complexity during both training and inference. Through empirical analysis across question answering, mathematical reasoning, and speech tasks, we demonstrate that maintaining consistency between training and inference dynamics is important for effective adaptation, particularly for sequential reasoning tasks. Our findings reveal that input-dependent parameter allocation achieves higher performance with fewer parameters by optimally matching rank configurations to question complexity. Furthermore, task-specific dependency on rank dynamics varies, with mathematical reasoning tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.