Dialogue Act Tagging with Transformation-Based Learning

Ken Samuel; Sandra Carberry; and K. Vijay-Shanker (Department of; Computer; Information Sciences; University of Delaware)

arXiv:cmp-lg/9806006·cmp-lg·May 23, 2007·29 cites

Dialogue Act Tagging with Transformation-Based Learning

Ken Samuel, Sandra Carberry, and K. Vijay-Shanker (Department of, Computer, Information Sciences, University of Delaware)

PDF

Open Access

TL;DR

This paper applies Transformation-Based Learning to dialogue act tagging, introducing new features and strategies to improve efficiency and accuracy in recognizing dialogue acts.

Contribution

It introduces novel features, automatic cue construction methods, and efficiency strategies for TBL in dialogue act tagging, achieving state-of-the-art accuracy.

Findings

01

Achieves accuracy comparable to leading systems

02

Introduces effective dialogue act cues and feature extraction methods

03

Develops efficient training and confidence estimation strategies

Abstract

For the task of recognizing dialogue acts, we are applying the Transformation-Based Learning (TBL) machine learning algorithm. To circumvent a sparse data problem, we extract values of well-motivated features of utterances, such as speaker direction, punctuation marks, and a new feature, called dialogue act cues, which we find to be more effective than cue phrases and word n-grams in practice. We present strategies for constructing a set of dialogue act cues automatically by minimizing the entropy of the distribution of dialogue acts in a training corpus, filtering out irrelevant dialogue act cues, and clustering semantically-related words. In addition, to address limitations of TBL, we introduce a Monte Carlo strategy for training efficiently and a committee method for computing confidence measures. These ideas are combined in our working implementation, which labels held-out data as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems