Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across   Language Models

Zijun Wu; Yongkang Wu; Lili Mou

arXiv:2310.01691·cs.CL·July 15, 2024·1 cites

Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models

Zijun Wu, Yongkang Wu, Lili Mou

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a zero-shot method for transferring continuous prompts across different language models by encoding prompts into a relative space, enhancing cross-model generalization of task semantics.

Contribution

The work presents a novel zero-shot transfer technique for continuous prompts that encodes task semantics into a relative space, enabling effective transfer across diverse models.

Findings

01

Transferability of prompts improves across models

02

Combining multiple source prompts enhances generalization

03

Method confirms effectiveness through experimental results

Abstract

Prompt tuning in natural language processing (NLP) has become an increasingly popular method for adapting large language models to specific tasks. However, the transferability of these prompts, especially continuous prompts, between different models remains a challenge. In this work, we propose a zero-shot continuous prompt transfer method, where source prompts are encoded into relative space and the corresponding target prompts are searched for transferring to target models. Experimental results confirm the effectiveness of our method, showing that 'task semantics' in continuous prompts can be generalized across various language models. Moreover, we find that combining 'task semantics' from multiple source models can further enhance the generalizability of transfer.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

manga-uofa/ptfer
pytorchOfficial

Videos

Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications