RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts
Mingyan Wu, Zhenghao Liu, Yukun Yan, Xinze Li, Shi Yu, Zheni Zeng, Yu, Gu, Ge Yu

TL;DR
RankCoT improves retrieval-augmented generation by refining knowledge through ranking chain-of-thoughts, enabling LLMs to better filter irrelevant information and produce more accurate, concise answers.
Contribution
The paper introduces RankCoT, a novel method that incorporates reranking signals and self-reflection to enhance knowledge refinement in retrieval-augmented generation.
Findings
RankCoT outperforms existing knowledge refinement models in accuracy.
It produces shorter, more effective refinement outputs.
The method enhances LLMs' ability to filter irrelevant documents.
Abstract
Retrieval-Augmented Generation (RAG) enhances the performance of Large Language Models (LLMs) by incorporating external knowledge. However, LLMs still encounter challenges in effectively utilizing the knowledge from retrieved documents, often being misled by irrelevant or noisy information. To address this issue, we introduce RankCoT, a knowledge refinement method that incorporates reranking signals in generating CoT-based summarization for knowledge refinement based on given query and all retrieval documents. During training, RankCoT prompts the LLM to generate Chain-of-Thought (CoT) candidates based on the query and individual documents. It then fine-tunes the LLM to directly reproduce the best CoT from these candidate outputs based on all retrieved documents, which requires LLM to filter out irrelevant documents during generating CoT-style summarization. Additionally, RankCoT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRecommender Systems and Techniques · AI in Service Interactions · Semantic Web and Ontologies
