RankCoT: Refining Knowledge for Retrieval-Augmented Generation through   Ranking Chain-of-Thoughts

Mingyan Wu; Zhenghao Liu; Yukun Yan; Xinze Li; Shi Yu; Zheni Zeng; Yu; Gu; Ge Yu

arXiv:2502.17888·cs.CL·February 26, 2025

RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts

Mingyan Wu, Zhenghao Liu, Yukun Yan, Xinze Li, Shi Yu, Zheni Zeng, Yu, Gu, Ge Yu

PDF

Open Access 1 Repo 1 Video

TL;DR

RankCoT improves retrieval-augmented generation by refining knowledge through ranking chain-of-thoughts, enabling LLMs to better filter irrelevant information and produce more accurate, concise answers.

Contribution

The paper introduces RankCoT, a novel method that incorporates reranking signals and self-reflection to enhance knowledge refinement in retrieval-augmented generation.

Findings

01

RankCoT outperforms existing knowledge refinement models in accuracy.

02

It produces shorter, more effective refinement outputs.

03

The method enhances LLMs' ability to filter irrelevant documents.

Abstract

Retrieval-Augmented Generation (RAG) enhances the performance of Large Language Models (LLMs) by incorporating external knowledge. However, LLMs still encounter challenges in effectively utilizing the knowledge from retrieved documents, often being misled by irrelevant or noisy information. To address this issue, we introduce RankCoT, a knowledge refinement method that incorporates reranking signals in generating CoT-based summarization for knowledge refinement based on given query and all retrieval documents. During training, RankCoT prompts the LLM to generate Chain-of-Thought (CoT) candidates based on the query and individual documents. It then fine-tunes the LLM to directly reproduce the best CoT from these candidate outputs based on all retrieved documents, which requires LLM to filter out irrelevant documents during generating CoT-style summarization. Additionally, RankCoT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neuir/rankcot
pytorchOfficial

Videos

RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts· underline

Taxonomy

TopicsRecommender Systems and Techniques · AI in Service Interactions · Semantic Web and Ontologies