RaFe: Ranking Feedback Improves Query Rewriting for RAG

Shengyu Mao; Yong Jiang; Boli Chen; Xiao Li; Peng Wang; Xinyu Wang,; Pengjun Xie; Fei Huang; Huajun Chen; Ningyu Zhang

arXiv:2405.14431·cs.CL·May 24, 2024·1 cites

RaFe: Ranking Feedback Improves Query Rewriting for RAG

Shengyu Mao, Yong Jiang, Boli Chen, Xiao Li, Peng Wang, Xinyu Wang,, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang

PDF

Open Access

TL;DR

This paper introduces RaFe, a framework that enhances query rewriting in RAG systems by utilizing ranking feedback from a reranker, eliminating the need for annotated data and improving performance.

Contribution

RaFe is a novel, annotation-free framework that leverages ranking feedback from rerankers to improve query rewriting in RAG systems.

Findings

01

RaFe outperforms baseline methods in query rewriting tasks.

02

Utilizing ranking feedback improves the quality of rewritten queries.

03

The framework reduces reliance on annotated data for training.

Abstract

As Large Language Models (LLMs) and Retrieval Augmentation Generation (RAG) techniques have evolved, query rewriting has been widely incorporated into the RAG system for downstream tasks like open-domain QA. Many works have attempted to utilize small models with reinforcement learning rather than costly LLMs to improve query rewriting. However, current methods require annotations (e.g., labeled relevant documents or downstream answers) or predesigned rewards for feedback, which lack generalization, and fail to utilize signals tailored for query rewriting. In this paper, we propose ours, a framework for training query rewriting models free of annotations. By leveraging a publicly available reranker, ours~provides feedback aligned well with the rewriting objectives. Experimental results demonstrate that ours~can obtain better performance than baselines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Linear Warmup With Linear Decay · Attention Dropout · Linear Layer · Multi-Head Attention · Residual Connection · Weight Decay · Byte Pair Encoding