Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation
Chen Dun, Mirian Hipolito Garcia, Guoqing Zheng, Ahmed Hassan, Awadallah, Anastasios Kyrillidis, Robert Sim

TL;DR
This paper introduces Mixture of Prompts (MoPs) with smart gating to adapt large language models to heterogeneous tasks and data, improving performance and efficiency across various scenarios.
Contribution
The paper proposes a novel MoPs framework with a dynamic gating mechanism for multi-task prompt adaptation, addressing task heterogeneity and model compression effects.
Findings
MoPs reduce perplexity by up to 70% in federated scenarios.
MoPs improve performance by 3-30% in centralized settings.
The gating mechanism effectively identifies relevant skills for diverse tasks.
Abstract
Large Language Models (LLMs) have the ability to solve a variety of tasks, such as text summarization and mathematical questions, just out of the box, but they are often trained with a single task in mind. Due to high computational costs, the current trend is to use prompt instruction tuning to better adjust monolithic, pretrained LLMs for new -- but often individual -- downstream tasks. Thus, how one would expand prompt tuning to handle -- concomitantly -- heterogeneous tasks and data distributions is a widely open question. To address this gap, we suggest the use of \emph{Mixture of Prompts}, or MoPs, associated with smart gating functionality: the latter -- whose design is one of the contributions of this paper -- can identify relevant skills embedded in different groups of prompts and dynamically assign combined experts (i.e., collection of prompts), based on the target task.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
