LoRA Is Slower Than You Think

Seokmin Ko

arXiv:2507.08833·cs.LG·July 15, 2025

LoRA Is Slower Than You Think

Seokmin Ko

PDF

Open Access

TL;DR

This paper critically examines LoRA's actual speed benefits in fine-tuning large language models, revealing inconsistencies and proposing more efficient alternatives that maintain performance while improving training speed.

Contribution

The paper provides a comprehensive analysis of LoRA's performance limitations and introduces new methods for more consistent and efficient LLM fine-tuning.

Findings

01

LoRA does not always improve training speed across different models.

02

Proposed methods achieve similar or better performance than LoRA.

03

New approaches offer more consistent training speed improvements.

Abstract

Low-Rank Adaptation (LoRA) is one of the most widely used techniques for fine-tuning large language models (LLMs). By introducing a small number of trainable low-rank weight matrices, LoRA substantially reduces the number of parameters that need to be updated, offering significant advantages in memory consumption and computational efficiency compared to full fine-tuning. However, we observed that LoRA does not consistently provide speed improvements across all model architectures and training setups. Motivated by this inconsistency, we conduct a comprehensive analysis of LoRA's performance and investigate the underlying factors limiting its speedup. Based on our findings, we propose several methods for more efficient fine-tuning of LLMs. We empirically evaluate these methods and compare them to LoRA, demonstrating that our approach achieves comparable or superior performance while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications