Ask Me Anything: A simple strategy for prompting language models
Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha,, Kush Bhatia, Ines Chami, Frederic Sala, Christopher R\'e

TL;DR
This paper introduces AMA, a prompting strategy that uses multiple prompts and aggregation to improve language model performance without extensive prompt engineering, achieving results comparable or superior to larger models.
Contribution
The paper proposes AMA, a simple method that transforms inputs into question-answer format and combines multiple prompts using weak supervision, significantly enhancing model performance.
Findings
AMA improves performance by an average of 10.2% over few-shot baselines.
Open-source GPT-J-6B matches or exceeds GPT-3 175B on many benchmarks.
Using multiple prompts and aggregation reduces the need for perfect prompt design.
Abstract
Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a natural language prompt that demonstrates how to perform the task and no additional training. Prompting is a brittle process wherein small modifications to the prompt can cause large variations in the model predictions, and therefore significant effort is dedicated towards designing a painstakingly "perfect prompt" for a task. To mitigate the high degree of effort involved in prompt-design, we instead ask whether producing multiple effective, yet imperfect, prompts and aggregating them can lead to a high quality prompting strategy. Our observations motivate our proposed prompting method, ASK ME ANYTHING (AMA). We first develop an understanding of the effective prompt formats, finding that question-answering (QA) prompts, which encourage open-ended generation ("Who went to the park?") tend to outperform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education
MethodsOPT
