TL;DR
UCS is a training-free method for selecting demonstrations in in-context learning that estimates the coverage of latent clusters to improve model performance.
Contribution
It introduces a novel coverage-based, training-free demonstration selection method using latent clusters and a Smoothed Good–Turing estimator, enhancing ICL accuracy.
Findings
UCS improves ICL accuracy by 2-6% across multiple benchmarks.
UCS provides insights into task- and model-level latent cluster distributions.
Combining UCS with existing baselines consistently enhances performance.
Abstract
In-context learning (ICL) performance depends critically on which demonstrations are placed in the prompt, yet most existing selectors prioritize heuristic notions of relevance or diversity and provide limited insight into the coverage of a demonstration set. We propose Unseen Coverage Selection (UKS), a training-free, subset-level coverage prior motivated by the principle that a good demonstration set should expose the model to latent cluster unrevealed by the currently selected subset. UCS operationalizes this idea by (1) inducing discrete latent clusters from model-consistent embeddings and (2) estimating the number of unrevealed clusters within a candidate subset via a Smoothed Good--Turing estimator from its empirical frequency spectrum. Unlike previous selection methods, UCS is coverage-based and training-free, and can be seamlessly combined with both query-dependent and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
