BoostLoRA: Growing Effective Rank by Boosting Adapters
Raviteja Anantha, Nick Levato, Layne C. Price

TL;DR
BoostLoRA is a gradient-boosting PEFT framework that iteratively trains and merges ultra-low-rank adapters, enabling effective rank growth and improved performance without inference overhead.
Contribution
It introduces a novel method where effective rank grows with training, surpassing existing PEFT methods and full fine-tuning in various tasks.
Findings
Achieves 89.1% on GSM8K, surpassing single-shot adapters and full fine-tuning.
Reaches 57.2% on MBPP, outperforming full fine-tuning.
First PEFT method with effective rank growth during training.
Abstract
Parameter-efficient fine-tuning (PEFT) methods face a tradeoff between adapter size and expressivity: ultra-low-parameter adapters are confined to fixed low-rank subspaces, capping performance even with extended training. We propose BoostLoRA, a gradient-boosting framework that overcomes this limit by iteratively training and merging minimal adapters on the examples the current model gets wrong. A ROTATE SVD basis strategy assigns each round to an orthogonal subspace, so cumulative effective rank grows linearly with the number of rounds while each adapter remains ultra-low-rank. After merging, adapters are discarded, leaving zero inference overhead. On Qwen2.5-3B, BoostLoRA reaches 89.1% on GSM8K and 68.8% on MATH-500, surpassing both the best single-shot ultra-low parameter adapter (TinyLoRA) and full fine-tuning; on code generation it reaches 57.2% on MBPP and 80.4% on HumanEval while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
