TL;DR
ConvDR is a novel conversational dense retrieval system that learns contextualized embeddings, performs well in few-shot settings, and effectively captures relevant context while ignoring noise, improving conversational search accuracy.
Contribution
We introduce ConvDR, a few-shot capable conversational dense retrieval model that uses a teacher-student framework to learn contextualized embeddings for multi-turn queries.
Findings
Outperforms previous sparse retrieval systems in conversational search.
Matches oracle reformulation accuracy in retrieval tasks.
More efficient and effective in capturing relevant context.
Abstract
Dense retrieval (DR) has the potential to resolve the query understanding challenge in conversational search by matching in the learned embedding space. However, this adaptation is challenging due to DR models' extra needs for supervision signals and the long-tail nature of conversational search. In this paper, we present a Conversational Dense Retrieval system, ConvDR, that learns contextualized embeddings for multi-turn conversational queries and retrieves documents solely using embedding dot products. In addition, we grant ConvDR few-shot ability using a teacher-student framework, where we employ an ad hoc dense retriever as the teacher, inherit its document encodings, and learn a student query encoder to mimic the teacher embeddings on oracle reformulated queries. Our experiments on TREC CAsT and OR-QuAC demonstrate ConvDR's effectiveness in both few-shot and fully-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHigh-Order Consensuses
