Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-$k$

Chihiro Taguchi; Seiji Maekawa; Nikita Bhutani

arXiv:2506.08479·cs.CL·October 1, 2025

Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive-$k$

Chihiro Taguchi, Seiji Maekawa, Nikita Bhutani

PDF

1 Video

TL;DR

This paper introduces Adaptive-$k$, a simple, single-pass method for dynamically selecting the optimal number of context passages in long-context QA, improving efficiency and accuracy without model tuning or iterative prompting.

Contribution

Adaptive-$k$ is a novel, non-iterative approach that adaptively determines context size based on similarity scores, outperforming fixed-k methods in both factoid and aggregation QA tasks.

Findings

01

Matches or exceeds fixed-$k$ baselines in accuracy.

02

Uses up to 10x fewer tokens than full-context input.

03

Retrieves 70% of relevant passages.

Abstract

Retrieval-augmented generation (RAG) and long-context language models (LCLMs) both address context limitations of LLMs in open-domain question answering (QA). However, optimal external context to retrieve remains an open problem: fixing the retrieval size risks either wasting tokens or omitting key evidence. Existing adaptive methods like Self-RAG and Self-Route rely on iterative LLM prompting and perform well on factoid QA, but struggle with aggregation QA, where the optimal context size is both unknown and variable. We present Adaptive- $k$ retrieval, a simple and effective single-pass method that adaptively selects the number of passages based on the distribution of the similarity scores between the query and the candidate passages. It does not require model fine-tuning, extra LLM inferences or changes to existing retriever-reader pipelines. On both factoid and aggregation QA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Efficient Context Selection for Long-Context QA: No Tuning, No Iteration, Just Adaptive‑k· underline