Model ensemble instead of prompt fusion: a sample-specific knowledge   transfer method for few-shot prompt tuning

Xiangyu Peng; Chen Xing; Prafulla Kumar Choubey; Chien-Sheng Wu,; Caiming Xiong

arXiv:2210.12587·cs.CL·March 2, 2023·5 cites

Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning

Xiangyu Peng, Chen Xing, Prafulla Kumar Choubey, Chien-Sheng Wu,, Caiming Xiong

PDF

Open Access 1 Video

TL;DR

This paper introduces SESoM, a sample-specific ensemble method for prompt tuning that improves few-shot learning performance by adaptively combining source model predictions, outperforming existing prompt fusion techniques.

Contribution

Proposes SESoM, a novel sample-specific ensemble approach for prompt transfer, enhancing few-shot prompt tuning performance over traditional prompt fusion methods.

Findings

01

SESoM outperforms existing prompt fusion methods in few-shot tasks.

02

Ensemble of source models improves generalization in low-data regimes.

03

Method is effective across multiple NLP tasks and model scales.

Abstract

Prompt tuning approaches, which learn task-specific soft prompts for a downstream task conditioning on frozen pre-trained models, have attracted growing interest due to its parameter efficiency. With large language models and sufficient training data, prompt tuning performs comparably to full-model tuning. However, with limited training samples in few-shot settings, prompt tuning fails to match the performance of full-model fine-tuning. In this work, we focus on improving the few-shot performance of prompt tuning by transferring knowledge from soft prompts of source tasks. Recognizing the good generalization capabilities of ensemble methods in low-data regime, we first experiment and show that a simple ensemble of model predictions based on different source prompts, outperforms existing multi-prompt knowledge transfer approaches such as source prompt fusion in the few-shot setting.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification