Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT
Da Chang, Peng Xue, Yu Li, Yongxiang Liu, Pengxiang Xu, Shixun Zhang

TL;DR
This paper introduces a unified framework for weight conditioning in PEFT, revealing DoRA's effectiveness as a learnable conditioning method and proposing two novel, efficient techniques that improve performance on NLP tasks.
Contribution
It reformulates DoRA as a learnable weight conditioning method and develops a unified framework for designing advanced PEFT techniques, including Pre-Diag and SORA.
Findings
Pre-Diag improves performance and reduces training time.
SORA achieves superior performance with norm-preserving transformations.
Proposed methods outperform LoRA and DoRA in NLP tasks.
Abstract
Parameter-Efficient Fine-Tuning (PEFT) methods are crucial for adapting large pre-trained models. Among these, LoRA is considered a foundational approach. Building on this, the influential DoRA method enhances performance by decomposing weight updates into magnitude and direction. However, its underlying mechanism remains unclear, and it introduces significant computational overhead. In this work, we first identify that DoRA's success stems from its capacity to increase the singular value entropy of the weight update matrix, which promotes a more uniform update distribution akin to full fine-tuning. We then reformulate DoRA into a mathematically equivalent and more efficient matrix form, revealing it as a learnable weight conditioning method. Based on this insight, we propose a unified framework for designing advanced PEFT methods by exploring two orthogonal dimensions: the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Neural Network Applications
