TuckA: Hierarchical Compact Tensor Experts for Efficient Fine-Tuning
Qifeng Lei, Zhiyong Yang, Qianqian Xu, Cong Hua, Peisong Wen, Qingming Huang

TL;DR
TuckA introduces a hierarchical, tensor-based approach for parameter-efficient fine-tuning that captures diverse data features with fewer parameters, improving performance across multiple tasks.
Contribution
The paper proposes Tucker Adaptation (TuckA), a novel hierarchical tensor expert method that enhances PEFT by using Tucker decomposition and efficient routing for diverse task adaptation.
Findings
Outperforms existing PEFT methods on NLP, vision, and reasoning benchmarks.
Reduces parameter count while maintaining or improving accuracy.
Demonstrates effective data-aware initialization and expert routing strategies.
Abstract
Efficiently fine-tuning pre-trained models for downstream tasks is a key challenge in the era of foundation models. Parameter-efficient fine-tuning (PEFT) presents a promising solution, achieving performance comparable to full fine-tuning by updating only a small number of adaptation weights per layer. Traditional PEFT methods typically rely on a single expert, where the adaptation weight is a low-rank matrix. However, for complex tasks, the data's inherent diversity poses a significant challenge for such models, as a single adaptation weight cannot adequately capture the features of all samples. To address this limitation, we explore how to integrate multiple small adaptation experts into a compact structure to defeat a large adapter. Specifically, we propose Tucker Adaptation (TuckA), a method with four key properties: (i) We use Tucker decomposition to create a compact 3D tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Tensor decomposition and applications
