Ask Me Anything: A simple strategy for prompting language models

Simran Arora; Avanika Narayan; Mayee F. Chen; Laurel Orr; Neel Guha,; Kush Bhatia; Ines Chami; Frederic Sala; Christopher R\'e

arXiv:2210.02441·cs.CL·November 22, 2022·74 cites

Ask Me Anything: A simple strategy for prompting language models

Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha,, Kush Bhatia, Ines Chami, Frederic Sala, Christopher R\'e

PDF

Open Access 3 Repos 2 Videos

TL;DR

This paper introduces AMA, a prompting strategy that uses multiple prompts and aggregation to improve language model performance without extensive prompt engineering, achieving results comparable or superior to larger models.

Contribution

The paper proposes AMA, a simple method that transforms inputs into question-answer format and combines multiple prompts using weak supervision, significantly enhancing model performance.

Findings

01

AMA improves performance by an average of 10.2% over few-shot baselines.

02

Open-source GPT-J-6B matches or exceeds GPT-3 175B on many benchmarks.

03

Using multiple prompts and aggregation reduces the need for perfect prompt design.

Abstract

Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a natural language prompt that demonstrates how to perform the task and no additional training. Prompting is a brittle process wherein small modifications to the prompt can cause large variations in the model predictions, and therefore significant effort is dedicated towards designing a painstakingly "perfect prompt" for a task. To mitigate the high degree of effort involved in prompt-design, we instead ask whether producing multiple effective, yet imperfect, prompts and aggregating them can lead to a high quality prompting strategy. Our observations motivate our proposed prompting method, ASK ME ANYTHING (AMA). We first develop an understanding of the effective prompt formats, finding that question-answering (QA) prompts, which encourage open-ended generation ("Who went to the park?") tend to outperform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Ask Me Anything: A Simple Strategy For Prompting Language Models | Paper Explained· youtube

Ask Me Anything: A simple strategy for prompting language models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education

MethodsOPT