From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative   Optimization and Generation

Xingchen Wan; Han Zhou; Ruoxi Sun; Hootan Nakhost; Ke Jiang; Sercan; \"O. Ar{\i}k

arXiv:2502.00330·cs.LG·February 4, 2025

From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation

Xingchen Wan, Han Zhou, Ruoxi Sun, Hootan Nakhost, Ke Jiang, Sercan, \"O. Ar{\i}k

PDF

Open Access

TL;DR

This paper introduces BRIDGE, a method that iteratively identifies influential examples and generates new ones to improve many-shot in-context learning performance in large language models.

Contribution

It presents a novel algorithm combining Bayesian optimization and example generation to enhance many-shot ICL beyond simple scaling.

Findings

01

BRIDGE significantly improves performance across multiple tasks.

02

Influential examples play a key role in ICL effectiveness.

03

The method works on various LLMs of different sizes.

Abstract

Recent advances in long-context large language models (LLMs) have led to the emerging paradigm of many-shot in-context learning (ICL), where it is observed that scaling many more demonstrating examples beyond the conventional few-shot setup in the context can lead to performance benefits. However, despite its promise, it is unclear what aspects dominate the benefits and whether simply scaling to more examples is the most effective way of improving many-shot ICL. In this work, we first provide an analysis of the factors driving many-shot ICL, and we find that 1) many-shot performance can still be attributed to often a few disproportionately influential examples and 2) identifying such influential examples ("optimize") and using them as demonstrations to regenerate new examples ("generate") can lead to further improvements. Inspired by the findings, we propose BRIDGE, an algorithm that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference