Learning Dialogue Representations from Consecutive Utterances
Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma,, Andrew O. Arnold, Bing Xiang

TL;DR
This paper introduces Dialogue Sentence Embedding (DSE), a self-supervised contrastive learning method that learns effective dialogue representations from consecutive utterances, significantly improving performance on various dialogue tasks especially in few-shot and zero-shot scenarios.
Contribution
The paper proposes DSE, a simple yet effective contrastive learning approach that leverages consecutive dialogue utterances to learn high-quality dialogue representations.
Findings
DSE outperforms existing dialogue representation models.
Achieves 13% average improvement in 1-shot intent classification.
Effective in few-shot and zero-shot dialogue tasks.
Abstract
Learning high-quality dialogue representations is essential for solving a variety of dialogue-oriented tasks, especially considering that dialogue systems often suffer from data scarcity. In this paper, we introduce Dialogue Sentence Embedding (DSE), a self-supervised contrastive learning method that learns effective dialogue representations suitable for a wide range of dialogue tasks. DSE learns from dialogues by taking consecutive utterances of the same dialogue as positive pairs for contrastive learning. Despite its simplicity, DSE achieves significantly better representation capability than other dialogue representation and universal sentence representation models. We evaluate DSE on five downstream dialogue tasks that examine dialogue representation at different semantic granularities. Experiments in few-shot and zero-shot settings show that DSE outperforms baselines by a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
MethodsLinear Layer · Contrastive Learning · Residual Connection · Weight Decay · Attention Dropout · Linear Warmup With Linear Decay · WordPiece · Adam · Dropout · Softmax
