Easier to Judge than to Find: Predicting In-Context Learning Success for Demonstration Selection
Haochun Wang,Chaofen Yang,Jiatong Liu,Jingbo Wang,Zewen Qiang,Sendong Zhao,Bing Qin,Ting Liu

TL;DR
This paper introduces DiSP, a framework for predicting the success of demonstration-context pairs in in-context learning, improving selection efficiency and speed.
Contribution
DiSP offers a novel sample-and-judge approach that predicts demonstration success, enabling faster and more effective demonstration selection in ICL.
Findings
DiSP improves average accuracy by up to 3.4% over baselines.
Achieves up to 23x speedup in end-to-end inference.
Effective across multiple classification datasets with large language models.
Abstract
In-context learning (ICL) is highly sensitive to which demonstrations appear in the prompt, but selecting them is expensive because the space of possible demonstration contexts and combinations is enormous. We argue that demonstration selection is \emph{easier to judge than to find}: predicting whether a specific query--context pair will succeed is cheaper and more general than searching for an optimal . Based on this insight, we propose DiSP, a sample-and-judge framework that stratifies queries by difficulty. DiSP runs random demonstration trials to estimate success rate of each training query, trains a lightweight router to predict difficulty from the query, and trains level-specific judges for sampled demonstrations. At inference, DiSP performs stop-on-acceptance judging under an explicit budget, emitting diagnostic risk tags when no suitable context is found. Across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
