Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task   Adaptation

Chen Dun; Mirian Hipolito Garcia; Guoqing Zheng; Ahmed Hassan; Awadallah; Anastasios Kyrillidis; Robert Sim

arXiv:2310.02842·cs.CL·January 22, 2025·2 cites

Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation

Chen Dun, Mirian Hipolito Garcia, Guoqing Zheng, Ahmed Hassan, Awadallah, Anastasios Kyrillidis, Robert Sim

PDF

Open Access

TL;DR

This paper introduces Mixture of Prompts (MoPs) with smart gating to adapt large language models to heterogeneous tasks and data, improving performance and efficiency across various scenarios.

Contribution

The paper proposes a novel MoPs framework with a dynamic gating mechanism for multi-task prompt adaptation, addressing task heterogeneity and model compression effects.

Findings

01

MoPs reduce perplexity by up to 70% in federated scenarios.

02

MoPs improve performance by 3-30% in centralized settings.

03

The gating mechanism effectively identifies relevant skills for diverse tasks.

Abstract

Large Language Models (LLMs) have the ability to solve a variety of tasks, such as text summarization and mathematical questions, just out of the box, but they are often trained with a single task in mind. Due to high computational costs, the current trend is to use prompt instruction tuning to better adjust monolithic, pretrained LLMs for new -- but often individual -- downstream tasks. Thus, how one would expand prompt tuning to handle -- concomitantly -- heterogeneous tasks and data distributions is a widely open question. To address this gap, we suggest the use of \emph{Mixture of Prompts}, or MoPs, associated with smart gating functionality: the latter -- whose design is one of the contributions of this paper -- can identify relevant skills embedded in different groups of prompts and dynamically assign combined experts (i.e., collection of prompts), based on the target task.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis