ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval
Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya, Sakai, Zhicheng Dou

TL;DR
ChatRetriever leverages large language models with contrastive learning and instruction tuning to improve conversational dense retrieval, achieving state-of-the-art results and robustness across multiple benchmarks.
Contribution
The paper introduces a simple dual-learning approach with masked instruction tuning to adapt LLMs for conversational retrieval, enhancing session understanding and robustness.
Findings
Outperforms existing conversational dense retrievers on five benchmarks.
Achieves state-of-the-art performance comparable to LLM-based rewriting methods.
Demonstrates superior robustness in diverse conversational contexts.
Abstract
Conversational search requires accurate interpretation of user intent from complex multi-turn contexts. This paper presents ChatRetriever, which inherits the strong generalization capability of large language models to robustly represent complex conversational sessions for dense retrieval. To achieve this, we propose a simple and effective dual-learning approach that adapts LLM for retrieval via contrastive learning while enhancing the complex session understanding through masked instruction tuning on high-quality conversational instruction tuning data. Extensive experiments on five conversational search benchmarks demonstrate that ChatRetriever substantially outperforms existing conversational dense retrievers, achieving state-of-the-art performance on par with LLM-based rewriting approaches. Furthermore, ChatRetriever exhibits superior robustness in handling diverse conversational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsContrastive Learning
