TL;DR
RaCoT introduces a retrieval-aware contrastive framework that enhances large language model reasoning by proactively focusing on critical details, improving accuracy, robustness, and efficiency in retrieval-augmented generation tasks.
Contribution
It presents a novel pre-retrieval contrastive question generation method that guides models to better distinguish answer-divergent details, surpassing existing retrieval techniques.
Findings
Outperforms RankRAG and Self-RAG by 0.9-2.4% on key benchmarks.
Demonstrates only 8.6% performance drop under adversarial conditions.
Maintains low latency (3.12s) and token overhead (11.54), achieving a favorable accuracy-efficiency balance.
Abstract
Retrieval-Augmented Generation (RAG) faces a core bottleneck with knowledge-sparse and semantically ambiguous long-tail queries, where retrieval noise distorts reasoning and necessitates costly post-processing. To tackle this, we propose RaCoT (Retrieval-aware Contrastive-of-Thought), a novel framework that shifts contrastive thinking to the pre-retrieval stage. By automatically generating a semantically adjacent yet differently answered contrastive question and extracting a -Prompt to capture their key differences, RaCoT guides the model to proactively focus on the ``critical details that determine answer divergence." This approach allows it to suppress semantic interference within a single retrieval pass, overcoming the theoretical bottleneck of single-vector queries that struggle to simultaneously encode signals for what to attend to and what to ignore. On six authoritative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
