CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation
Zhipeng Song, Yizhi Zhou, Xiangyu Kong, Jiulong Jiao, Xuezhou Ye, Chunqi Gao, Xueqing Shi, Yuhang Zhou, Heng Qi

TL;DR
CAR is a novel reranking framework that uses generator confidence changes to improve document selection for retrieval-augmented generation, enhancing answer quality across various datasets and models.
Contribution
It introduces a training-free, query-guided reranking method that leverages generator confidence to better select useful documents for RAG tasks.
Findings
CAR improves NDCG@5 across multiple datasets and retrievers.
It enhances the YesNo reranker by 25.4% on average with Contriever.
Ranking gains from CAR strongly correlate with downstream F1 improvements.
Abstract
Retrieval-Augmented Generation (RAG) depends on document ranking to provide useful evidence for generation, but conventional reranking methods mainly optimize query-document relevance rather than generation usefulness. A relevant document may still introduce noise, while a lower-ranked document may better reduce the generator's uncertainty. We propose CAR (Confidence-Aware Reranking), a query-guided, training-free, and plug-and-play reranking framework that uses generator confidence change as a document usefulness signal. CAR estimates confidence through the semantic consistency of multiple sampled answers under query-only and query-document conditions. Documents that significantly increase confidence are promoted, those that decrease confidence are demoted, and uncertain cases preserve the baseline order, while a query-level gate avoids unnecessary intervention on already confident…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
