DoRA: Weight-Decomposed Low-Rank Adaptation
Shih-Yang Liu, Chien-Yi Wang, Hongxu Yin, Pavlo Molchanov, Yu-Chiang, Frank Wang, Kwang-Ting Cheng, Min-Hung Chen

TL;DR
This paper introduces DoRA, a weight-decomposed low-rank adaptation method that improves fine-tuning accuracy and stability of pre-trained models without increasing inference costs.
Contribution
The paper proposes DoRA, a novel weight decomposition approach that enhances LoRA's learning capacity and stability, narrowing the accuracy gap with full fine-tuning.
Findings
DoRA outperforms LoRA on multiple benchmarks
It improves fine-tuning stability and capacity
Effective across various models and tasks
Abstract
Among the widely used parameter-efficient fine-tuning (PEFT) methods, LoRA and its variants have gained considerable popularity because of avoiding additional inference costs. However, there still often exists an accuracy gap between these methods and full fine-tuning (FT). In this work, we first introduce a novel weight decomposition analysis to investigate the inherent differences between FT and LoRA. Aiming to resemble the learning capacity of FT from the findings, we propose Weight-Decomposed Low-Rank Adaptation (DoRA). DoRA decomposes the pre-trained weight into two components, magnitude and direction, for fine-tuning, specifically employing LoRA for directional updates to efficiently minimize the number of trainable parameters. By employing \ours, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗serpdotai/sparsetral-16x7B-v2-SPIN_iter1model· 9 dl· ♡ 139 dl♡ 13
- 🤗trollek/NinjaMouse-2.4B-32L-danubemodel· 111 dl· ♡ 8111 dl♡ 8
- 🤗cgus/NinjaMouse-2.4B-32L-danube-exl2model· 3 dl3 dl
- 🤗zhichen/Qwen-WisdomVastmodel· 3 dl· ♡ 13 dl♡ 1
- 🤗zhichen/Qwen-WisdomVast-Loramodel· ♡ 5♡ 5
- 🤗zhichen/Llama3-Chinesemodel· 11 dl· ♡ 1911 dl♡ 19
- 🤗zhichen/Llama3-Chinese-Loramodel· ♡ 4♡ 4
- 🤗LMFResearchSociety/SDXLDoRAArchivemodel· ♡ 5♡ 5
- 🤗anamikac2708/Mistral-7B-DORA-finetuned-investopedia-Lora-Adaptersmodel
- 🤗sunatte/txt2sqlmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques
