Scalable Prompt Generation for Semi-supervised Learning with Language   Models

Yuhang Zhou; Suraj Maharjan; Beiye Liu

arXiv:2302.09236·cs.CL·February 21, 2023·1 cites

Scalable Prompt Generation for Semi-supervised Learning with Language Models

Yuhang Zhou, Suraj Maharjan, Beiye Liu

PDF

Open Access

TL;DR

This paper introduces automated methods for prompt and verbalizer design in semi-supervised learning with language models, significantly reducing manual effort while maintaining high performance across NLP tasks.

Contribution

It proposes two automatic prompt generation techniques and a verbalizer method, improving scalability and efficiency in semi-supervised NLP learning.

Findings

01

Achieved 73.2% average accuracy, surpassing previous methods by 2.52%.

02

Automated prompts match or outperform manual prompts in few-shot learning.

03

Methods are effective across multiple NLP datasets and tasks.

Abstract

Prompt-based learning methods in semi-supervised learning (SSL) settings have been shown to be effective on multiple natural language understanding (NLU) datasets and tasks in the literature. However, manually designing multiple prompts and verbalizers requires domain knowledge and human effort, making it difficult and expensive to scale across different datasets. In this paper, we propose two methods to automatically design multiple prompts and integrate automatic verbalizer in SSL settings without sacrificing performance. The first method uses various demonstration examples with learnable continuous prompt tokens to create diverse prompt models. The second method uses a varying number of soft prompt tokens to encourage language models to learn different prompts. For the verbalizer, we use the prototypical verbalizer to replace the manual one. In summary, we obtained the best average…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications