Generic Dependency Modeling for Multi-Party Conversation

Weizhou Shen; Xiaojun Quan; Ke Yang

arXiv:2302.10680·cs.CL·February 22, 2023

Generic Dependency Modeling for Multi-Party Conversation

Weizhou Shen, Xiaojun Quan, Ke Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a generic dependency modeling framework for multi-party conversations using dependency parsing, enhancing Transformer models' performance by encoding utterance dependencies through ReDE.

Contribution

It proposes a novel dependency encoding method, ReDE, integrated into Transformers to improve modeling of multi-party conversational dependencies.

Findings

01

Boosts performance of Transformer-based models on four benchmarks

02

Achieves comparable or superior results to state-of-the-art methods

03

Demonstrates effectiveness of dependency encoding in conversation modeling

Abstract

To model the dependencies between utterances in multi-party conversations, we propose a simple and generic framework based on the dependency parsing results of utterances. Particularly, we present an approach to encoding the dependencies in the form of relative dependency encoding (ReDE) and illustrate how to implement it in Transformers by modifying the computation of self-attention. Experimental results on four multi-party conversation benchmarks show that this framework successfully boosts the general performance of two Transformer-based language models and leads to comparable or even superior performance compared to the state-of-the-art methods. The codes are available at https://github.com/shenwzh3/ReDE.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shenwzh3/rede
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems