Effective Incorporation of Speaker Information in Utterance Encoding in   Dialog

Tianyu Zhao; Tatsuya Kawahara

arXiv:1907.05599·eess.AS·July 15, 2019·6 cites

Effective Incorporation of Speaker Information in Utterance Encoding in Dialog

Tianyu Zhao, Tatsuya Kawahara

PDF

Open Access

TL;DR

This paper introduces a relative speaker modeling method that improves dialog encoding by effectively incorporating speaker information, leading to better performance in dialog act recognition and response generation.

Contribution

A novel relative speaker modeling approach that addresses inconsistencies in speaker annotations and enhances dialog encoding effectiveness.

Findings

01

Improved dialog act recognition accuracy

02

Enhanced response generation quality

03

More consistent performance across dialogs

Abstract

In dialog studies, we often encode a dialog using a hierarchical encoder where each utterance is converted into an utterance vector, and then a sequence of utterance vectors is converted into a dialog vector. Since knowing who produced which utterance is essential to understanding a dialog, conventional methods tried integrating speaker labels into utterance vectors. We found the method problematic in some cases where speaker annotations are inconsistent among different dialogs. A relative speaker modeling method is proposed to address the problem. Experimental evaluations on dialog act recognition and response generation show that the proposed method yields superior and more consistent performances.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Natural Language Processing Techniques