PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts
Anjir Ahmed Chowdhury, Syed Zawad, Xiaolong Ma, Xu Dong, Feng Yan

TL;DR
PEML introduces a parameter-efficient multi-task learning framework that co-optimizes continuous prompts and model weights, achieving significant accuracy improvements across multiple benchmarks.
Contribution
It proposes a novel neural architecture engineering method for joint prompt optimization and model adaptation in multi-task learning.
Findings
Achieves up to 6.67% average accuracy improvement over state-of-the-art methods.
Demonstrates peak gains of up to 10.75% on individual tasks.
Validates effectiveness across multiple benchmarks including GLUE and SuperGLUE.
Abstract
Parameter-Efficient Fine-Tuning (PEFT) is widely used for adapting Large Language Models (LLMs) for various tasks. Recently, there has been an increasing demand for fine-tuning a single LLM for multiple tasks because it requires overall less data for fine-tuning thanks to the common features shared among tasks. More importantly, LLMs are resource demanding and deploying a single model for multiple tasks facilitates resource consolidation and consumes significantly less resources compared to deploying individual large model for each task. Existing PEFT methods like LoRA and Prefix Tuning are designed to adapt LLMs to a specific task. LoRA and its variation focus on aligning the model itself for tasks, overlooking the importance of prompt tuning in multi-task learning while Prefix Tuning only adopts a simple architecture to optimize prompts, which limits the adaption capabilities for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
