Towards Informative Few-Shot Prompt with Maximum Information Gain for   In-Context Learning

Hongfu Liu; Ye Wang

arXiv:2310.08923·cs.CL·October 16, 2023·1 cites

Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning

Hongfu Liu, Ye Wang

PDF

Open Access

TL;DR

This paper proposes a method to select the most informative examples for in-context learning in large language models by maximizing information gain, leading to more stable and effective few-shot prompts.

Contribution

It introduces a novel approach to quantify and maximize information gain in example selection, and addresses template bias with a calibration strategy.

Findings

01

Achieves 14.3% average relative improvement across six classification tasks

02

Reduces variance in in-context learning performance

03

Enhances stability and fairness in few-shot prompting

Abstract

Large Language models (LLMs) possess the capability to engage In-context Learning (ICL) by leveraging a few demonstrations pertaining to a new downstream task as conditions. However, this particular learning paradigm suffers from high instability stemming from substantial variances induced by factors such as the input distribution of selected examples, their ordering, and prompt formats. In this work, we demonstrate that even when all these factors are held constant, the random selection of examples still results in high variance. Consequently, we aim to explore the informative ability of data examples by quantifying the Information Gain (IG) obtained in prediction after observing a given example candidate. Then we propose to sample those with maximum IG. Additionally, we identify the presence of template bias, which can lead to unfair evaluations of IG during the sampling process. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification