Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

Haozhan Tang; Xiuqi Zhu; Xinyin Zhang; Boxun Li; Virginia Smith; Kevin Kuo

arXiv:2605.07111·cs.CL·May 19, 2026

Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

Haozhan Tang, Xiuqi Zhu, Xinyin Zhang, Boxun Li, Virginia Smith, Kevin Kuo

PDF

1 Repo

TL;DR

This paper introduces MoLF, a dynamic framework that adaptively combines full fine-tuning and LoRA for large language model adaptation, improving stability and performance across diverse tasks.

Contribution

The paper proposes MoLF, a novel optimizer routing method that seamlessly integrates FFT and LoRA, enabling flexible, stable, and efficient LLM fine-tuning.

Findings

01

MoLF improves or matches FFT and LoRA performance within 1.5% across tasks.

02

MoLF-Efficient outperforms prior adaptive LoRA by up to 20% on Fact datasets.

03

Empirical evaluations across multiple models and tasks validate the effectiveness of MoLF.

Abstract

Recent literature on fine-tuning Large Language Models highlights a fundamental debate. While Full Fine-Tuning (FFT) provides the representational plasticity required for high-entropy knowledge injection, Low-Rank Adaptation (LoRA) can match or surpass FFT performance because many tasks only require updates in a low-rank space and benefit from LoRA's additional regularization. Through empirical evaluation across diverse tasks (SQL, Medical QA, and Counterfactual Knowledge) and varying language models (Gemma-3-1B, Qwen2.5-1.5B, and Qwen2.5-3B), we verify both trends and demonstrate that relying solely on either static architecture is structurally limited. To address this challenge, we propose a Mixture of LoRA and Full (MoLF) Fine-Tuning, a unified framework that enables continuous navigation between both training regimes. MoLF dynamically routes updates between FFT and LoRA at the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

11785T23/molf.git
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.