CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models

Guanduo Chen; Yutong He; Yipeng Hu; Kun Yuan; Binhang Yuan

arXiv:2502.01378·cs.LG·February 4, 2025·2 cites

CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models

Guanduo Chen, Yutong He, Yipeng Hu, Kun Yuan, Binhang Yuan

PDF

Open Access

TL;DR

CE-LoRA is a novel fine-tuning method for large language models that reduces computational costs by approximating matrix operations and improving gradient accuracy, maintaining performance while enhancing efficiency.

Contribution

The paper introduces CE-LoRA, a computation-efficient fine-tuning algorithm that significantly lowers computational costs of LoRA without sacrificing accuracy.

Findings

01

CE-LoRA reduces training time compared to LoRA.

02

CE-LoRA maintains comparable model performance.

03

Theoretical convergence rate matches that of LoRA.

Abstract

Large Language Models (LLMs) demonstrate exceptional performance across various tasks but demand substantial computational resources even for fine-tuning computation. Although Low-Rank Adaptation (LoRA) significantly alleviates memory consumption during fine-tuning, its impact on computational cost reduction is limited. This paper identifies the computation of activation gradients as the primary bottleneck in LoRA's backward propagation and introduces the Computation-Efficient LoRA (CE-LoRA) algorithm, which enhances computational efficiency while preserving memory efficiency. CE-LoRA leverages two key techniques: Approximated Matrix Multiplication, which replaces dense multiplications of large and complete matrices with sparse multiplications involving only critical rows and columns, and the Double-LoRA technique, which reduces error propagation in activation gradients. Theoretically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Topic Modeling