SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

Tu Vu; Brian Lester; Noah Constant; Rami Al-Rfou; Daniel Cer

arXiv:2110.07904·cs.CL·March 18, 2022

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer

Tu Vu, Brian Lester, Noah Constant, Rami Al-Rfou, Daniel Cer

PDF

Open Access

TL;DR

SPoT introduces a prompt transfer method that enhances frozen pre-trained language models' performance on various NLP tasks, matching or surpassing full fine-tuning with significantly fewer parameters.

Contribution

The paper presents SPoT, a novel soft prompt transfer technique that improves task adaptation efficiency and effectiveness in frozen models, outperforming standard fine-tuning on SuperGLUE.

Findings

01

SPoT significantly boosts prompt tuning performance.

02

SPoT matches or exceeds model tuning on SuperGLUE.

03

Many tasks benefit from prompt transfer across diverse NLP tasks.

Abstract

There has been growing interest in parameter-efficient methods to apply pre-trained language models to downstream tasks. Building on the Prompt Tuning approach of Lester et al. (2021), which learns task-specific soft prompts to condition a frozen pre-trained model to perform different tasks, we propose a novel prompt-based transfer learning approach called SPoT: Soft Prompt Transfer. SPoT first learns a prompt on one or more source tasks and then uses it to initialize the prompt for a target task. We show that SPoT significantly boosts the performance of Prompt Tuning across many tasks. More remarkably, across all model sizes, SPoT matches or outperforms standard Model Tuning (which fine-tunes all model parameters) on the SuperGLUE benchmark, while using up to 27,000x fewer task-specific parameters. To understand where SPoT is most effective, we conduct a large-scale study on task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education