MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models   with Sparse Mixture of Low-Rank Adapter Experts

Yusheng Liao; Shuyang Jiang; Yu Wang; Yanfeng Wang

arXiv:2404.09027·cs.CL·April 16, 2024·1 cites

MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts

Yusheng Liao, Shuyang Jiang, Yu Wang, Yanfeng Wang

PDF

Open Access 2 Repos

TL;DR

MING-MOE is a novel medical large language model that leverages a sparse mixture of low-rank adapters to handle diverse medical tasks efficiently without task-specific annotations, achieving state-of-the-art results.

Contribution

The paper introduces MING-MOE, a MOE-based model utilizing MoLoRA for efficient multi-task learning in medical NLP without requiring task-specific annotations.

Findings

01

Achieves SOTA on over 20 medical tasks

02

Improves inference efficiency

03

Handles diverse medical tasks without task-specific annotations

Abstract

Large language models like ChatGPT have shown substantial progress in natural language understanding and generation, proving valuable across various disciplines, including the medical field. Despite advancements, challenges persist due to the complexity and diversity inherent in medical tasks which often require multi-task learning capabilities. Previous approaches, although beneficial, fall short in real-world applications because they necessitate task-specific annotations at inference time, limiting broader generalization. This paper introduces MING-MOE, a novel Mixture-of-Expert~(MOE)-based medical large language model designed to manage diverse and complex medical tasks without requiring task-specific annotations, thus enhancing its usability across extensive datasets. MING-MOE employs a Mixture of Low-Rank Adaptation (MoLoRA) technique, allowing for efficient parameter usage by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Artificial Intelligence in Healthcare and Education

MethodsSparse Evolutionary Training · Balanced Selection