SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules
Xiangyu Chen, Jing Liu, Ye Wang, Pu Perry Wang, Matthew Brand,, Guanghui Wang, Toshiaki Koike-Akino

TL;DR
SuperLoRA is a flexible, unified framework that extends low-rank adaptation techniques, enabling efficient fine-tuning of large models with superior performance, especially when parameter budgets are extremely limited.
Contribution
It introduces a generalized SuperLoRA framework that unifies and extends various LoRA variants through novel hyper-parameter settings and techniques.
Findings
Superior transfer learning performance in few-parameter regimes
High flexibility compared to existing LoRA variants
Effective in fine-tuning large language and diffusion models
Abstract
Low-rank adaptation (LoRA) and its variants are widely employed in fine-tuning large models, including large language models for natural language processing and diffusion models for computer vision. This paper proposes a generalized framework called SuperLoRA that unifies and extends different LoRA variants, which can be realized under different hyper-parameter settings. Introducing grouping, folding, shuffling, projecting, and tensor factoring, SuperLoRA offers high flexibility compared with other LoRA variants and demonstrates superior performance for transfer learning tasks especially in the extremely few-parameter regimes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Brain Tumor Detection and Classification · Advanced Neural Network Applications
MethodsDiffusion
