PTP: Boosting Stability and Performance of Prompt Tuning with   Perturbation-Based Regularizer

Lichang Chen; Heng Huang; Minhao Cheng

arXiv:2305.02423·cs.CL·May 5, 2023·1 cites

PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer

Lichang Chen, Heng Huang, Minhao Cheng

PDF

Open Access

TL;DR

This paper introduces PTP, a perturbation-based regularizer that stabilizes prompt tuning training and enhances performance on NLP benchmarks by smoothing the loss landscape.

Contribution

The paper proposes a novel perturbation-based regularizer for prompt tuning, significantly improving stability and performance over existing methods.

Findings

01

Reduces training instability in prompt tuning.

02

Improves performance on SuperGLUE and FewGLUE benchmarks.

03

Effectively smooths the loss landscape with perturbations.

Abstract

Recent studies show that prompt tuning can better leverage the power of large language models than fine-tuning on downstream natural language understanding tasks. However, the existing prompt tuning methods have training instability issues, as the variance of scores under different random seeds is quite large. To address this critical problem, we first investigate and find that the loss landscape of vanilla prompt tuning is precipitous when it is visualized, where a slight change of input data can cause a big fluctuation in the loss landscape. This is an essential factor that leads to the instability of prompt tuning. Based on this observation, we introduce perturbation-based regularizers, which can smooth the loss landscape, into prompt tuning. We propose a new algorithm, called Prompt Tuning with Perturbation-based regularizer~(PTP), which can not only alleviate training instability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications