Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension

Zhuosheng Zhang; Hai Zhao; Longxiang Liu

arXiv:2301.03953·cs.CL·January 12, 2023

Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension

Zhuosheng Zhang, Hai Zhao, Longxiang Liu

PDF

TL;DR

This paper introduces a novel channel-aware decoupling network that enhances multi-turn dialogue comprehension by explicitly modeling speaker roles and utterance interactions, significantly improving performance over existing pre-trained language models.

Contribution

The paper proposes a decoupling mechanism in Transformer-based PrLMs to better capture hierarchical dialogue features, addressing limitations of sequential dialogue modeling.

Findings

01

Achieves state-of-the-art results on four benchmark datasets.

02

Substantially improves baseline PrLM performance.

03

Effectively models speaker roles and utterance dependencies.

Abstract

Training machines to understand natural language and interact with humans is one of the major goals of artificial intelligence. Recent years have witnessed an evolution from matching networks to pre-trained language models (PrLMs). In contrast to the plain-text modeling as the focus of the PrLMs, dialogue texts involve multiple speakers and reflect special characteristics such as topic transitions and structure dependencies between distant utterances. However, the related PrLM models commonly represent dialogues sequentially by processing the pairwise dialogue history as a whole. Thus the hierarchical information on either utterance interrelation or speaker roles coupled in such representations is not well addressed. In this work, we propose compositional learning for holistic interaction across the utterances beyond the sequential contextualization from PrLMs, in order to capture the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.