Conversational Query Rewriting with Self-supervised Learning

Hang Liu; Meng Chen; Youzheng Wu; Xiaodong He; Bowen Zhou

arXiv:2102.04708·cs.CL·February 22, 2021

Conversational Query Rewriting with Self-supervised Learning

Hang Liu, Meng Chen, Youzheng Wu, Xiaodong He, Bowen Zhou

PDF

Open Access

TL;DR

This paper introduces a self-supervised learning approach for conversational query rewriting, creating a large dataset and a Transformer-based model that improves multi-turn dialogue understanding without extensive manual annotation.

Contribution

The paper presents a novel self-supervised dataset construction method and a Transformer-based CQR model with keyword detection and intent constraints, advancing dialogue system capabilities.

Findings

01

Outperforms existing CQR baselines significantly

02

Self-supervised learning effectively improves CQR performance

03

Demonstrates robustness on multiple datasets

Abstract

Context modeling plays a critical role in building multi-turn dialogue systems. Conversational Query Rewriting (CQR) aims to simplify the multi-turn dialogue modeling into a single-turn problem by explicitly rewriting the conversational query into a self-contained utterance. However, existing approaches rely on massive supervised training data, which is labor-intensive to annotate. And the detection of the omitted important information from context can be further improved. Besides, intent consistency constraint between contextual query and rewritten query is also ignored. To tackle these issues, we first propose to construct a large-scale CQR dataset automatically via self-supervised learning, which does not need human annotation. Then we introduce a novel CQR model Teresa based on Transformer, which is enhanced by self-attentive keywords detection and intent consistency constraint.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Attention Is All You Need · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Dense Connections · Residual Connection