HD-PiSSA: High-Rank Distributed Orthogonal Adaptation
Yiding Wang, Fauxu Meng, Xuefeng Zhang, Fan Jiang, Pingzhi Tang, Muhan Zhang

TL;DR
HD-PiSSA is a novel distributed PEFT method for large language models that significantly increases update expressiveness by assigning different principal components to each device, leading to substantial performance improvements.
Contribution
It introduces a high-rank distributed PEFT approach that assigns orthogonal adapters across devices, expanding update directions and improving fine-tuning performance on complex tasks.
Findings
Over 16x higher effective update ranks with 8 GPUs.
Achieves 10.0 absolute point gains over LoRA in multi-task learning.
Demonstrates significant performance improvements across diverse benchmarks.
Abstract
Existing parameter-efficient fine-tuning (PEFT) methods for large language models (LLMs), such as LoRA and PiSSA, constrain model updates to low-rank subspaces, limiting their expressiveness and leading to suboptimal performance on complex tasks. To address this, we introduce High-rank Distributed PiSSA (HD-PiSSA), a distributed PEFT approach that initializes orthogonal adapters across different devices and aggregates their delta updates collectively on W for fine-tuning. Unlike Data Parallel LoRA or PiSSA, which maintain identical adapters across all devices, HD-PiSSA assigns different principal components of the pre-trained weights to each GPU, significantly expanding the range of update directions. This results in over 16x higher effective updated ranks than data-parallel LoRA or PiSSA when fine-tuning on 8 GPUs with the same per-device adapter rank. Empirically, we evaluate HD-PiSSA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · CCD and CMOS Imaging Sensors · Image Enhancement Techniques
MethodsAdapter
