CONVERSER: Few-Shot Conversational Dense Retrieval with Synthetic Data   Generation

Chao-Wei Huang; Chen-Yu Hsu; Tsu-Yuan Hsu; Chen-An Li; Yun-Nung Chen

arXiv:2309.06748·cs.CL·September 14, 2023·1 cites

CONVERSER: Few-Shot Conversational Dense Retrieval with Synthetic Data Generation

Chao-Wei Huang, Chen-Yu Hsu, Tsu-Yuan Hsu, Chen-An Li, Yun-Nung Chen

PDF

Open Access 1 Repo

TL;DR

CONVERSER is a framework that leverages large language models to generate synthetic conversational data, enabling effective training of dense retrieval models with minimal in-domain examples, thus reducing data collection costs.

Contribution

The paper introduces a novel few-shot training method for conversational dense retrieval using synthetic data generated by large language models.

Findings

01

Achieves comparable performance to fully-supervised models on benchmarks

02

Reduces the need for large in-domain conversational datasets

03

Demonstrates effectiveness of synthetic data in conversational IR

Abstract

Conversational search provides a natural interface for information retrieval (IR). Recent approaches have demonstrated promising results in applying dense retrieval to conversational IR. However, training dense retrievers requires large amounts of in-domain paired data. This hinders the development of conversational dense retrievers, as abundant in-domain conversations are expensive to collect. In this paper, we propose CONVERSER, a framework for training conversational dense retrievers with at most 6 examples of in-domain dialogues. Specifically, we utilize the in-context learning capability of large language models to generate conversational queries given a passage in the retrieval corpus. Experimental results on conversational retrieval benchmarks OR-QuAC and TREC CAsT 19 show that the proposed CONVERSER achieves comparable performance to fully-supervised models, demonstrating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

miulab/converser
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications