DoRA: Weight-Decomposed Low-Rank Adaptation

Shih-Yang Liu; Chien-Yi Wang; Hongxu Yin; Pavlo Molchanov; Yu-Chiang; Frank Wang; Kwang-Ting Cheng; Min-Hung Chen

arXiv:2402.09353·cs.CL·July 10, 2024·37 cites

DoRA: Weight-Decomposed Low-Rank Adaptation

Shih-Yang Liu, Chien-Yi Wang, Hongxu Yin, Pavlo Molchanov, Yu-Chiang, Frank Wang, Kwang-Ting Cheng, Min-Hung Chen

PDF

Open Access 5 Repos 10 Models

TL;DR

This paper introduces DoRA, a weight-decomposed low-rank adaptation method that improves fine-tuning accuracy and stability of pre-trained models without increasing inference costs.

Contribution

The paper proposes DoRA, a novel weight decomposition approach that enhances LoRA's learning capacity and stability, narrowing the accuracy gap with full fine-tuning.

Findings

01

DoRA outperforms LoRA on multiple benchmarks

02

It improves fine-tuning stability and capacity

03

Effective across various models and tasks

Abstract

Among the widely used parameter-efficient fine-tuning (PEFT) methods, LoRA and its variants have gained considerable popularity because of avoiding additional inference costs. However, there still often exists an accuracy gap between these methods and full fine-tuning (FT). In this work, we first introduce a novel weight decomposition analysis to investigate the inherent differences between FT and LoRA. Aiming to resemble the learning capacity of FT from the findings, we propose Weight-Decomposed Low-Rank Adaptation (DoRA). DoRA decomposes the pre-trained weight into two components, magnitude and direction, for fine-tuning, specifically employing LoRA for directional updates to efficiently minimize the number of trainable parameters. By employing \ours, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Enhancement Techniques