Generalizing Conversational Dense Retrieval via LLM-Cognition Data   Augmentation

Haonan Chen; Zhicheng Dou; Kelong Mao; Jiongnan Liu; Ziliang Zhao

arXiv:2402.07092·cs.CL·June 5, 2024·1 cites

Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation

Haonan Chen, Zhicheng Dou, Kelong Mao, Jiongnan Liu, Ziliang Zhao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ConvAug, a novel framework that enhances conversational dense retrieval models by generating diverse, cognition-inspired augmented data and employing contrastive learning, significantly improving generalization across various conversational scenarios.

Contribution

ConvAug is the first to incorporate multi-level data augmentation and cognition-aware filtering for conversational retrieval, addressing data sparsity and improving model robustness.

Findings

01

Improves retrieval accuracy on four public datasets.

02

Enhances zero-shot generalization capabilities.

03

Demonstrates effectiveness of cognition-inspired augmentation.

Abstract

Conversational search utilizes muli-turn natural language contexts to retrieve relevant passages. Existing conversational dense retrieval models mostly view a conversation as a fixed sequence of questions and responses, overlooking the severe data sparsity problem -- that is, users can perform a conversation in various ways, and these alternate conversations are unrecorded. Consequently, they often struggle to generalize to diverse conversations in real-world scenarios. In this work, we propose a framework for generalizing Conversational dense retrieval via LLM-cognition data Augmentation (ConvAug). ConvAug first generates multi-level augmented conversations to capture the diverse nature of conversational contexts. Inspired by human cognition, we devise a cognition-aware process to mitigate the generation of false positives, false negatives, and hallucinations. Moreover, we develop a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haon-chen/convaug
pytorchOfficial

Videos

Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques

MethodsContrastive Learning