Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim, Hyunmo Yang, Younghyun Kim, Youngjoon Hong, and, Eunbyung Park

TL;DR
Hydra introduces a multi-head low-rank adaptation method that combines parallel and sequential branches to enhance parameter-efficient fine-tuning of large models, achieving superior performance across tasks.
Contribution
The paper proposes Hydra, a novel multi-head adapter that integrates parallel and sequential adaptation branches for more expressive and generalizable fine-tuning.
Findings
Hydra outperforms existing methods in efficiency and accuracy.
Empirical analysis confirms the complementary benefits of combined branches.
Extensive experiments validate Hydra's superior adaptability across tasks.
Abstract
The recent surge in large-scale foundation models has spurred the development of efficient methods for adapting these models to various downstream tasks. Low-rank adaptation methods, such as LoRA, have gained significant attention due to their outstanding parameter efficiency and no additional inference latency. This paper investigates a more general form of adapter module based on the analysis that parallel and sequential adaptation branches learn novel and general features during fine-tuning, respectively. The proposed method, named Hydra, due to its multi-head computational branches, combines parallel and sequential branch to integrate capabilities, which is more expressive than existing single branch methods and enables the exploration of a broader range of optimal points in the fine-tuning process. In addition, the proposed adaptation method explicitly leverages the pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Lattice Boltzmann Simulation Studies · Speech and Audio Processing
MethodsHydra · Adapter
