Conversational Query Rewriting with Self-supervised Learning
Hang Liu, Meng Chen, Youzheng Wu, Xiaodong He, Bowen Zhou

TL;DR
This paper introduces a self-supervised learning approach for conversational query rewriting, creating a large dataset and a Transformer-based model that improves multi-turn dialogue understanding without extensive manual annotation.
Contribution
The paper presents a novel self-supervised dataset construction method and a Transformer-based CQR model with keyword detection and intent constraints, advancing dialogue system capabilities.
Findings
Outperforms existing CQR baselines significantly
Self-supervised learning effectively improves CQR performance
Demonstrates robustness on multiple datasets
Abstract
Context modeling plays a critical role in building multi-turn dialogue systems. Conversational Query Rewriting (CQR) aims to simplify the multi-turn dialogue modeling into a single-turn problem by explicitly rewriting the conversational query into a self-contained utterance. However, existing approaches rely on massive supervised training data, which is labor-intensive to annotate. And the detection of the omitted important information from context can be further improved. Besides, intent consistency constraint between contextual query and rewritten query is also ignored. To tackle these issues, we first propose to construct a large-scale CQR dataset automatically via self-supervised learning, which does not need human annotation. Then we introduce a novel CQR model Teresa based on Transformer, which is enhanced by self-attentive keywords detection and intent consistency constraint.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Attention Is All You Need · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Dense Connections · Residual Connection
