UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations

Fengran Mo; Yifan Gao; Chuan Meng; Xin Liu; Zhuofeng Wu; Kelong Mao; Zhengyang Wang; Pei Chen; Zheng Li; Xian Li; Bing Yin; Meng Jiang

arXiv:2507.07030·cs.CL·July 14, 2025

UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations

Fengran Mo, Yifan Gao, Chuan Meng, Xin Liu, Zhuofeng Wu, Kelong Mao, Zhengyang Wang, Pei Chen, Zheng Li, Xian Li, Bing Yin, Meng Jiang

PDF

TL;DR

This paper introduces UniConv, a unified model that combines retrieval and response generation for conversational search, enhancing performance by joint fine-tuning and mechanisms to reduce inconsistency.

Contribution

It proposes a novel unified framework for dense retrieval and response generation in conversational systems, addressing limitations of separate models.

Findings

01

Outperforms existing baselines on five datasets.

02

Mutually improves retrieval and response generation.

03

Reduces inconsistency risks in unified modeling.

Abstract

The rapid advancement of conversational search systems revolutionizes how information is accessed by enabling the multi-turn interaction between the user and the system. Existing conversational search systems are usually built with two different models. This separation restricts the system from leveraging the intrinsic knowledge of the models simultaneously, which cannot ensure the effectiveness of retrieval benefiting the generation. The existing studies for developing unified models cannot fully address the aspects of understanding conversational context, managing retrieval independently, and generating responses. In this paper, we explore how to unify dense retrieval and response generation for large language models in conversation. We conduct joint fine-tuning with different objectives and design two mechanisms to reduce the inconsistency risks while mitigating data discrepancy. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.