Learning Locality and Isotropy in Dialogue Modeling

Han Wu; Haochen Tan; Mingjie Zhan; Gangming Zhao; Shaoqing Lu; Ding; Liang; Linqi Song

arXiv:2205.14583·cs.CL·January 31, 2023·1 cites

Learning Locality and Isotropy in Dialogue Modeling

Han Wu, Haochen Tan, Mingjie Zhan, Gangming Zhao, Shaoqing Lu, Ding, Liang, Linqi Song

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces SimDRC, a method to improve dialogue representations by enhancing locality and isotropy, leading to better performance on multiple dialogue tasks.

Contribution

The paper proposes a simple calibration method, SimDRC, to build isotropic and conversational feature spaces, addressing anisotropy and loss of conversation structure in dialogue models.

Findings

01

Significant performance improvements on three dialogue tasks.

02

Outperforms current state-of-the-art models.

03

Validated effectiveness through automatic and human evaluations.

Abstract

Existing dialogue modeling methods have achieved promising performance on various dialogue tasks with the aid of Transformer and the large-scale pre-trained language models. However, some recent studies revealed that the context representations produced by these methods suffer the problem of anisotropy. In this paper, we find that the generated representations are also not conversational, losing the conversation structure information during the context modeling stage. To this end, we identify two properties in dialogue modeling, i.e., locality and isotropy, and present a simple method for dialogue representation calibration, namely SimDRC, to build isotropic and conversational feature spaces. Experimental results show that our approach significantly outperforms the current state-of-the-art models on three dialogue tasks across the automatic and human evaluation metrics. More in-depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hahahawu/simdrc
pytorchOfficial

Videos

Learning Locality and Isotropy in Dialogue Modeling· slideslive

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Position-Wise Feed-Forward Layer · Byte Pair Encoding