Making Pre-trained Language Models End-to-end Few-shot Learners with   Contrastive Prompt Tuning

Ziyun Xu; Chengyu Wang; Minghui Qiu; Fuli Luo; Runxin Xu; Songfang; Huang; Jun Huang

arXiv:2204.00166·cs.CL·April 4, 2022·1 cites

Making Pre-trained Language Models End-to-end Few-shot Learners with Contrastive Prompt Tuning

Ziyun Xu, Chengyu Wang, Minghui Qiu, Fuli Luo, Runxin Xu, Songfang, Huang, Jun Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces CP-Tuning, an end-to-end contrastive prompt tuning method that enables pre-trained language models to perform few-shot learning without manual prompt engineering, improving task-invariance and class distinction.

Contribution

It presents the first fully trainable, verbalizer-free contrastive prompt tuning framework with task-invariant continuous prompts and cost-sensitive contrastive learning.

Findings

01

CP-Tuning outperforms state-of-the-art methods on various language understanding tasks.

02

It effectively eliminates the need for manual prompt engineering.

03

The approach enhances class distinction and decision boundary smoothness.

Abstract

Pre-trained Language Models (PLMs) have achieved remarkable performance for various language understanding tasks in IR systems, which require the fine-tuning process based on labeled training data. For low-resource scenarios, prompt-based learning for PLMs exploits prompts as task guidance and turns downstream tasks into masked language problems for effective few-shot fine-tuning. In most existing approaches, the high performance of prompt-based learning heavily relies on handcrafted prompts and verbalizers, which may limit the application of such approaches in real-world scenarios. To solve this issue, we present CP-Tuning, the first end-to-end Contrastive Prompt Tuning framework for fine-tuning PLMs without any manual engineering of task-specific prompts and verbalizers. It is integrated with the task-invariant continuous prompt encoding technique with fully trainable prompt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibaba/EasyNLP/tree/master/examples/fewshot_learning
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsContrastive Learning