Dialogue Act Sequence Labeling using Hierarchical encoder with CRF

Harshit Kumar; Arvind Agarwal; Riddhiman Dasgupta; Sachindra Joshi,; Arun Kumar

arXiv:1709.04250·cs.CL·September 15, 2017

Dialogue Act Sequence Labeling using Hierarchical encoder with CRF

Harshit Kumar, Arvind Agarwal, Riddhiman Dasgupta, Sachindra Joshi,, Arun Kumar

PDF

3 Repos 2 Datasets

TL;DR

This paper introduces a hierarchical neural network with CRF for dialogue act recognition, improving accuracy on benchmark datasets by modeling dependencies among dialogue acts and utterances.

Contribution

The work presents a novel hierarchical LSTM-CRF model that captures multi-level representations and label dependencies for dialogue act sequence labeling.

Findings

01

Achieved 2.2% and 4.1% accuracy improvements on two datasets.

02

Effectively models label and utterance dependencies in dialogues.

03

Performs well despite noisy training data.

Abstract

Dialogue Act recognition associate dialogue acts (i.e., semantic labels) to utterances in a conversation. The problem of associating semantic labels to utterances can be treated as a sequence labeling problem. In this work, we build a hierarchical recurrent neural network using bidirectional LSTM as a base unit and the conditional random field (CRF) as the top layer to classify each utterance into its corresponding dialogue act. The hierarchical network learns representations at multiple levels, i.e., word level, utterance level, and conversation level. The conversation level representations are input to the CRF layer, which takes into account not only all previous utterances but also their dialogue acts, thus modeling the dependency among both, labels and utterances, an important consideration of natural dialogue. We validate our approach on two different benchmark data sets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory