More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

Xiaoqing Zhang; Ang Lv; Yuhan Liu; Flood Sung; Wei Liu; Jian Luan; Shuo Shang; Xiuying Chen; Rui Yan

arXiv:2501.04070·cs.LG·May 28, 2025

More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

Xiaoqing Zhang, Ang Lv, Yuhan Liu, Flood Sung, Wei Liu, Jian Luan, Shuo Shang, Xiuying Chen, Rui Yan

PDF

Open Access 1 Video

TL;DR

This paper introduces DrICL, a novel optimization method with differentiated and reweighting objectives, to improve many-shot in-context learning performance of large language models, addressing data noise and optimization issues.

Contribution

It proposes DrICL, an innovative optimization approach, and develops ICL-50, a large-scale benchmark for evaluating many-shot in-context learning.

Findings

01

DrICL significantly improves many-shot ICL performance across diverse tasks.

02

The new benchmark ICL-50 enables comprehensive evaluation of in-context learning methods.

03

Experimental results show enhanced models outperform baseline approaches in both in-domain and out-of-domain scenarios.

Abstract

Large language models (LLMs) excel at few-shot in-context learning (ICL) without requiring parameter updates. However, as ICL demonstrations increase from a few to many, performance tends to plateau and eventually decline. We identify two primary causes for this trend: the suboptimal negative log-likelihood (NLL) optimization objective and the incremental data noise. To address these issues, we introduce \textit{DrICL}, a novel optimization method that enhances model performance through \textit{Differentiated} and \textit{Reweighting} objectives. Globally, DrICL utilizes differentiated learning to optimize the NLL objective, ensuring that many-shot performance surpasses zero-shot levels. Locally, it dynamically adjusts the weighting of many-shot demonstrations by leveraging cumulative advantages inspired by reinforcement learning, thereby mitigating the impact of noisy data. Recognizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning