Rank Also Matters: Hierarchical Configuration for Mixture of Adapter   Experts in LLM Fine-Tuning

Peizhuang Cong; Wenpu Liu; Wenhan Yu; Haochen Zhao; Tong Yang

arXiv:2502.03884·cs.LG·February 7, 2025

Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning

Peizhuang Cong, Wenpu Liu, Wenhan Yu, Haochen Zhao, Tong Yang

PDF

Open Access

TL;DR

This paper introduces HILO, a hierarchical scheme for optimizing the number and rank of adapter experts across layers in LLM fine-tuning, leading to improved accuracy with fewer trainable parameters.

Contribution

HILO is the first method to dynamically adjust both the number and rank of adapter experts across layers in a hierarchical manner for LLM fine-tuning.

Findings

01

HILO outperforms existing methods in accuracy.

02

HILO uses fewer trainable parameters.

03

HILO effectively matches layer complexity.

Abstract

Large language models (LLMs) have demonstrated remarkable success across various tasks, accompanied by a continuous increase in their parameter size. Parameter-efficient fine-tuning (PEFT) methods, such as Low-Rank Adaptation (LoRA), address the challenges of fine-tuning LLMs by significantly reducing the number of trainable parameters. Recent studies have integrated LoRA with Mixture of Experts (MoE) architectures, leveraging multiple adapter experts and gating mechanisms to further improve fine-tuning performance. However, existing approaches primarily focus on adjusting the allocations of adapter experts per layer to optimize the introduced trainable parameter size, while neglecting a critical factor of adapters' rank. To this end, we propose a hierarchical scheme for expert allocation and rank configuration, HILO, which dynamically adjusts the number and rank of adapter experts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Simulation Techniques and Applications · Semantic Web and Ontologies