Towards Specialized Generalists: A Multi-Task MoE-LoRA Framework for Domain-Specific LLM Adaptation
Yuxin Yang, Aoxiong Zeng, Xiangquan Yang

TL;DR
This paper introduces Med-MoE-LoRA, a novel multi-task framework combining Mixture-of-Experts and Low-Rank Adaptation to efficiently adapt large language models to medical domains, addressing stability and task interference issues.
Contribution
The paper proposes Med-MoE-LoRA, integrating asymmetric expert distribution and a knowledge-preservation plugin for improved domain-specific LLM adaptation.
Findings
Outperforms standard LoRA and MoE architectures on clinical NLP tasks.
Reduces task interference while maintaining general knowledge.
Achieves superior performance in medical benchmarks.
Abstract
The rapid evolution of Large Language Models (LLMs) has shifted focus from general-purpose capabilities to domain-specific expertise. However, adapting LLMs to specialized fields such as medicine presents two challenge: (1) the "Stability-Plasticity Dilemma", where the model must acquire complex clinical knowledge without suffering from catastrophic forgetting of general world knowledge; and (2) "Task Interference", where disparate sub-tasks, such as medical diagnosis, report summarization, and drug-drug interaction prediction, compete for limited low-rank parameter space. In this paper, we propose Med-MoE-LoRA, a novel framework that integrates Mixture-of-Experts (MoE) with Low-Rank Adaptation (LoRA) to enable efficient multi-task domain adaptation, especially for medical scenarios. Drawing inspiration from recent advances, our framework employs an asymmetric expert distribution where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare and Education · Topic Modeling
