Less is More: Compact Clue Selection for Efficient Retrieval-Augmented Generation Reasoning
Qianchi Zhang, Hainan Zhang, Liang Pang, Yongxin Tong, Hongwei Zheng, Zhiming Zheng

TL;DR
This paper introduces CompSelect, a compact clue selection method for retrieval-augmented generation that enhances reasoning efficiency by extracting and organizing minimal yet sufficient clues, reducing latency and improving performance.
Contribution
The paper proposes a novel three-component mechanism—clue extractor, reranker, and truncator—for efficient, well-organized clue selection tailored for LLM-centric RAG systems, inspired by Occam's razor.
Findings
Improves QA performance with fewer clues
Reduces reasoning latency significantly
Demonstrates robustness across datasets
Abstract
Current RAG retrievers are designed primarily for human readers, emphasizing complete, readable, and coherent paragraphs. However, Large Language Models (LLMs) benefit more from precise, compact, and well-structured input, which enhances reasoning quality and efficiency. Existing methods rely on reranking or summarization to identify key sentences, but may introduce semantic breaks and unfaithfulness. Thus, efficiently extracting and organizing answer-relevant clues from large-scale documents while reducing LLM reasoning costs remains challenging in RAG systems. Inspired by Occam's razor, we frame LLM-centric retrieval as MinMax optimization: maximizing the extraction of potential clues and reranking them for well-organization, while minimizing reasoning costs by truncating to the smallest sufficient set of clues. In this paper, we propose CompSelect, a compact clue selection mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Adam · Softmax · Dropout · Weight Decay · BART · WordPiece · Layer Normalization
