AutoPrompt: Eliciting Knowledge from Language Models with Automatically   Generated Prompts

Taylor Shin; Yasaman Razeghi; Robert L. Logan IV; Eric Wallace; Sameer; Singh

arXiv:2010.15980·cs.CL·November 10, 2020·66 cites

AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts

Taylor Shin, Yasaman Razeghi, Robert L. Logan IV, Eric Wallace, Sameer, Singh

PDF

Open Access 5 Repos

TL;DR

AutoPrompt introduces an automated, gradient-guided method for creating prompts that effectively elicit knowledge from language models, enabling tasks like sentiment analysis and relation extraction without additional training.

Contribution

It presents AutoPrompt, a novel automated prompt generation technique that enhances the probing of language models' knowledge without finetuning or manual prompt crafting.

Findings

01

AutoPrompt achieves competitive performance on sentiment analysis and natural language inference.

02

It elicits more accurate factual knowledge than manual prompts on the LAMA benchmark.

03

Language models can perform relation extraction more effectively with AutoPrompt than supervised models.

Abstract

The remarkable success of pretrained language models has motivated the study of what kinds of knowledge these models learn during pretraining. Reformulating tasks as fill-in-the-blanks problems (e.g., cloze tests) is a natural approach for gauging such knowledge, however, its usage is limited by the manual effort and guesswork required to write suitable prompts. To address this, we develop AutoPrompt, an automated method to create prompts for a diverse set of tasks, based on a gradient-guided search. Using AutoPrompt, we show that masked language models (MLMs) have an inherent capability to perform sentiment analysis and natural language inference without additional parameters or finetuning, sometimes achieving performance on par with recent state-of-the-art supervised models. We also show that our prompts elicit more accurate factual knowledge from MLMs than the manually created…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining

MethodsSoftmax · Tanh Activation · Low-Rank Factorization-based Multi-Head Attention