Active Example Selection for In-Context Learning

Yiming Zhang; Shi Feng; Chenhao Tan

arXiv:2211.04486·cs.CL·November 10, 2022·5 cites

Active Example Selection for In-Context Learning

Yiming Zhang, Shi Feng, Chenhao Tan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning approach to select demonstration examples for in-context learning, improving performance on smaller models and revealing limitations on larger models.

Contribution

It formulates example selection as a sequential decision problem and develops a RL-based method to identify effective demonstration examples for in-context learning.

Findings

01

RL-based policies improve GPT-2 performance by 5.8% on average

02

Selected examples enhance GPT-3 Ada performance slightly

03

Larger GPT-3 models show diminishing benefits from example selection

Abstract

With a handful of demonstration examples, large-scale language models show strong capability to perform various tasks by in-context learning from these examples, without any fine-tuning. We demonstrate that in-context learning performance can be highly unstable across samples of examples, indicating the idiosyncrasies of how language models acquire information. We formulate example selection for in-context learning as a sequential decision problem, and propose a reinforcement learning algorithm for identifying generalizable policies to select demonstration examples. For GPT-2, our learned policies demonstrate strong abilities of generalizing to unseen tasks in training, with a $5.8%$ improvement on average. Examples selected from our learned policies can even achieve a small improvement on GPT-3 Ada. However, the improvement diminishes on larger GPT-3 models, suggesting emerging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chicagohai/active-example-selection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · {Dispute@FaQ-s}How to file a dispute with Expedia? · Linear Warmup With Cosine Annealing · Attention Dropout · Dense Connections · Softmax · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia?