TL;DR
This paper introduces MoLF, a dynamic framework that adaptively combines full fine-tuning and LoRA for large language model adaptation, improving stability and performance across diverse tasks.
Contribution
The paper proposes MoLF, a novel optimizer routing method that seamlessly integrates FFT and LoRA, enabling flexible, stable, and efficient LLM fine-tuning.
Findings
MoLF improves or matches FFT and LoRA performance within 1.5% across tasks.
MoLF-Efficient outperforms prior adaptive LoRA by up to 20% on Fact datasets.
Empirical evaluations across multiple models and tasks validate the effectiveness of MoLF.
Abstract
Recent literature on fine-tuning Large Language Models highlights a fundamental debate. While Full Fine-Tuning (FFT) provides the representational plasticity required for high-entropy knowledge injection, Low-Rank Adaptation (LoRA) can match or surpass FFT performance because many tasks only require updates in a low-rank space and benefit from LoRA's additional regularization. Through empirical evaluation across diverse tasks (SQL, Medical QA, and Counterfactual Knowledge) and varying language models (Gemma-3-1B, Qwen2.5-1.5B, and Qwen2.5-3B), we verify both trends and demonstrate that relying solely on either static architecture is structurally limited. To address this challenge, we propose a Mixture of LoRA and Full (MoLF) Fine-Tuning, a unified framework that enables continuous navigation between both training regimes. MoLF dynamically routes updates between FFT and LoRA at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
