TL;DR
SAMoRA introduces a semantic-aware routing and task-adaptive scaling framework for multi-task learning with large language models, significantly improving expert specialization and task performance.
Contribution
It proposes a novel semantic-aware router and task-adaptive scaling mechanism for more precise expert routing and contribution regulation in multi-task learning.
Findings
Outperforms state-of-the-art methods on multiple benchmarks.
Demonstrates excellent task generalization capabilities.
Effectively promotes expert specialization and adaptive scaling.
Abstract
The combination of Mixture-of-Experts (MoE) and Low-Rank Adaptation (LoRA) has shown significant potential for enhancing the multi-task learning capabilities of Large Language Models. However, existing methods face two primary challenges: (1)Imprecise Routing in the current MoE-LoRA method fails to explicitly match input semantics with expert capabilities, leading to weak expert specialization. (2)Uniform weight fusion strategies struggle to provide adaptive update strengths, overlooking the varying complexity of different tasks. To address these limitations, we propose SAMoRA (Semantic-Aware Mixture of LoRA Experts), a novel parameter-efficient fine-tuning framework tailored for task-adaptive learning. Specifically, A Semantic-Aware Router is proposed to explicitly align textual semantics with the most suitable experts for precise routing. A Task-Adaptive Scaling mechanism is designed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
