CO3: Low-resource Contrastive Co-training for Generative Conversational   Query Rewrite

Yifei Yuan; Chen Shi; Runze Wang; Liyi Chen; Renjun Hu; Zengming; Zhang; Feijun Jiang; Wai Lam

arXiv:2403.11873·cs.CL·March 19, 2024·1 cites

CO3: Low-resource Contrastive Co-training for Generative Conversational Query Rewrite

Yifei Yuan, Chen Shi, Runze Wang, Liyi Chen, Renjun Hu, Zengming, Zhang, Feijun Jiang, Wai Lam

PDF

Open Access

TL;DR

This paper introduces CO3, a contrastive co-training approach that leverages unlabeled data to improve low-resource conversational query rewriting, making it more robust to noise and language style shifts.

Contribution

The paper proposes a novel contrastive co-training framework with dual models and data augmentation, enhancing low-resource query rewriting performance and generalization.

Findings

01

Outperforms existing methods in few-shot and zero-shot settings.

02

Demonstrates robustness to language style shifts.

03

Shows improved generalization with unlabeled data.

Abstract

Generative query rewrite generates reconstructed query rewrites using the conversation history while rely heavily on gold rewrite pairs that are expensive to obtain. Recently, few-shot learning is gaining increasing popularity for this task, whereas these methods are sensitive to the inherent noise due to limited data size. Besides, both attempts face performance degradation when there exists language style shift between training and testing cases. To this end, we study low-resource generative conversational query rewrite that is robust to both noise and language style shift. The core idea is to utilize massive unlabeled data to make further improvements via a contrastive co-training paradigm. Specifically, we co-train two dual models (namely Rewriter and Simplifier) such that each of them provides extra guidance through pseudo-labeling for enhancing the other in an iterative manner. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsContrastive Learning