TL;DR
This paper introduces PERO, a method that optimizes the order of examples in prompts for few-shot learning, significantly improving generalization with minimal data across NLP tasks.
Contribution
The paper proposes PERO, a novel approach that searches for optimal example orderings in prompts, enhancing few-shot learning performance without model fine-tuning.
Findings
PERO outperforms existing methods with as few as 10 examples.
Learning a new separator token can improve prompt effectiveness.
Order of examples in prompts critically affects model generalization.
Abstract
The ability to learn from limited data, or few-shot learning, is a desirable and often critical requirement for NLP systems. While many existing methods do poorly at learning from a handful of examples, large pretrained language models have recently been shown to be efficient few-shot learners. One approach to few-shot learning, which does not require finetuning of model parameters, is to augment the language model's input with priming text which is typically constructed using task specific descriptions and examples. In this work, we further explore priming-based few-shot learning, with focus on using examples as prompts. We show that presenting examples in the right order is key for generalization. We introduce PERO (Prompting with Examples in the Right Order), where we formulate few-shot learning as search over the set of permutations of the training examples. We show that PERO can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
