ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning

Changtai Zhu; Siyin Wang; Ruijun Feng; Kai Song; Xipeng Qiu

arXiv:2505.15776·cs.CL·September 16, 2025

ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning

Changtai Zhu, Siyin Wang, Ruijun Feng, Kai Song, Xipeng Qiu

PDF

Open Access 1 Repo 5 Models 4 Datasets 1 Video

TL;DR

ConvSearch-R1 introduces a reinforcement learning-based framework for conversational query reformulation that eliminates the need for external supervision, significantly improving retrieval performance on benchmark datasets.

Contribution

It is the first self-driven reformulation framework that directly optimizes retrieval signals without external annotations, using a novel two-stage reinforcement learning approach.

Findings

01

Over 10% improvement on TopiOCQA dataset

02

Effective reformulation with smaller models (3B parameters)

03

Outperforms previous state-of-the-art methods

Abstract

Conversational search systems require effective handling of context-dependent queries that often contain ambiguity, omission, and coreference. Conversational Query Reformulation (CQR) addresses this challenge by transforming these queries into self-contained forms suitable for off-the-shelf retrievers. However, existing CQR approaches suffer from two critical constraints: high dependency on costly external supervision from human annotations or large language models, and insufficient alignment between the rewriting model and downstream retrievers. We present ConvSearch-R1, the first self-driven framework that completely eliminates dependency on external rewrite supervision by leveraging reinforcement learning to optimize reformulation directly through retrieval signals. Our novel two-stage approach combines Self-Driven Policy Warm-Up to address the cold-start problem through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BeastyZ/ConvSearch-R1
pytorchOfficial

Models

Datasets

Videos

ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning· underline

Taxonomy

TopicsSpeech and dialogue systems · Semantic Web and Ontologies · Data Management and Algorithms