Learning Dialogue Representations from Consecutive Utterances

Zhihan Zhou; Dejiao Zhang; Wei Xiao; Nicholas Dingwall; Xiaofei Ma,; Andrew O. Arnold; Bing Xiang

arXiv:2205.13568·cs.CL·July 25, 2022

Learning Dialogue Representations from Consecutive Utterances

Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma,, Andrew O. Arnold, Bing Xiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Dialogue Sentence Embedding (DSE), a self-supervised contrastive learning method that learns effective dialogue representations from consecutive utterances, significantly improving performance on various dialogue tasks especially in few-shot and zero-shot scenarios.

Contribution

The paper proposes DSE, a simple yet effective contrastive learning approach that leverages consecutive dialogue utterances to learn high-quality dialogue representations.

Findings

01

DSE outperforms existing dialogue representation models.

02

Achieves 13% average improvement in 1-shot intent classification.

03

Effective in few-shot and zero-shot dialogue tasks.

Abstract

Learning high-quality dialogue representations is essential for solving a variety of dialogue-oriented tasks, especially considering that dialogue systems often suffer from data scarcity. In this paper, we introduce Dialogue Sentence Embedding (DSE), a self-supervised contrastive learning method that learns effective dialogue representations suitable for a wide range of dialogue tasks. DSE learns from dialogues by taking consecutive utterances of the same dialogue as positive pairs for contrastive learning. Despite its simplicity, DSE achieves significantly better representation capability than other dialogue representation and universal sentence representation models. We evaluate DSE on five downstream dialogue tasks that examine dialogue representation at different semantic granularities. Experiments in few-shot and zero-shot settings show that DSE outperforms baselines by a large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-research/dse
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques

MethodsLinear Layer · Contrastive Learning · Residual Connection · Weight Decay · Attention Dropout · Linear Warmup With Linear Decay · WordPiece · Adam · Dropout · Softmax