TL;DR
NeocorRAG introduces a retrieval quality optimization framework for RAG that enhances reasoning accuracy by mining and utilizing Evidence Chains, achieving state-of-the-art results efficiently.
Contribution
The paper proposes a novel retrieval quality optimization method using Evidence Chains and an activated search algorithm, significantly improving RAG performance.
Findings
NeocorRAG achieves SOTA performance on multiple benchmarks.
It uses less than 20% of tokens compared to comparable methods.
It effectively balances retrieval recall and quality optimization.
Abstract
Although precise recall is a core objective in Retrieval-Augmented Generation (RAG), a critical oversight persists in the field: improvements in retrieval performance do not consistently translate to commensurate gains in downstream reasoning. To diagnose this gap, we propose the Recall Conversion Rate (RCR), a novel evaluation metric to quantify the contribution of retrieval to reasoning accuracy. Our quantitative analysis of mainstream RAG methods reveals that as Recall@5 improves, the RCR exhibits a near-linear decay. We identify the neglect of retrieval quality in these methods as the underlying cause. In contrast, approaches that focus solely on quality optimization often suffer from inferior recall performance. Both categories lack a comprehensive understanding of retrieval quality optimization, resulting in a trade-off dilemma. To address these challenges, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
