SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer   Attention Modules

Xiangyu Chen; Jing Liu; Ye Wang; Pu Perry Wang; Matthew Brand,; Guanghui Wang; Toshiaki Koike-Akino

arXiv:2403.11887·cs.CV·March 19, 2024·1 cites

SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modules

Xiangyu Chen, Jing Liu, Ye Wang, Pu Perry Wang, Matthew Brand,, Guanghui Wang, Toshiaki Koike-Akino

PDF

Open Access

TL;DR

SuperLoRA is a flexible, unified framework that extends low-rank adaptation techniques, enabling efficient fine-tuning of large models with superior performance, especially when parameter budgets are extremely limited.

Contribution

It introduces a generalized SuperLoRA framework that unifies and extends various LoRA variants through novel hyper-parameter settings and techniques.

Findings

01

Superior transfer learning performance in few-parameter regimes

02

High flexibility compared to existing LoRA variants

03

Effective in fine-tuning large language and diffusion models

Abstract

Low-rank adaptation (LoRA) and its variants are widely employed in fine-tuning large models, including large language models for natural language processing and diffusion models for computer vision. This paper proposes a generalized framework called SuperLoRA that unifies and extends different LoRA variants, which can be realized under different hyper-parameter settings. Introducing grouping, folding, shuffling, projecting, and tensor factoring, SuperLoRA offers high flexibility compared with other LoRA variants and demonstrates superior performance for transfer learning tasks especially in the extremely few-parameter regimes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Brain Tumor Detection and Classification · Advanced Neural Network Applications

MethodsDiffusion