Speaker-Sensitive Dual Memory Networks for Multi-Turn Slot Tagging

Young-Bum Kim; Sungjin Lee; Ruhi Sarikaya

arXiv:1711.10705·cs.CL·November 30, 2017

Speaker-Sensitive Dual Memory Networks for Multi-Turn Slot Tagging

Young-Bum Kim, Sungjin Lee, Ruhi Sarikaya

PDF

Open Access

TL;DR

This paper introduces a speaker-sensitive dual memory network architecture that improves multi-turn slot tagging by effectively encoding dialog history based on speaker roles, leading to better contextual understanding.

Contribution

It proposes a novel neural model that encodes utterances differently depending on the speaker, enhancing multi-turn slot tagging accuracy.

Findings

01

Significant performance improvement over state-of-the-art models

02

Effective encoding of speaker-specific contextual information

03

Validated on real Microsoft Cortana data

Abstract

In multi-turn dialogs, natural language understanding models can introduce obvious errors by being blind to contextual information. To incorporate dialog history, we present a neural architecture with Speaker-Sensitive Dual Memory Networks which encode utterances differently depending on the speaker. This addresses the different extents of information available to the system - the system knows only the surface form of user utterances while it has the exact semantics of system output. We performed experiments on real user data from Microsoft Cortana, a commercial personal assistant. The result showed a significant performance improvement over the state-of-the-art slot tagging models using contextual information.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques