Dialogue Act Modeling for Automatic Tagging and Recognition of Conversational Speech
A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky,, P. Taylor, R. Martin, C. Van Ess-Dykema, M. Meteer

TL;DR
This paper presents a statistical model for automatically recognizing dialogue acts in conversational speech, integrating lexical, prosodic, and discourse cues to improve speech recognition and dialogue understanding.
Contribution
It introduces a novel probabilistic framework combining dialogue act modeling with speech recognition, using a hidden Markov model and neural networks trained on a large corpus.
Findings
Dialogue act labeling accuracy reached 65% with automatic speech recognition.
The model achieved 71% accuracy using perfect transcripts.
The approach resulted in a small reduction in word recognition error.
Abstract
We describe a statistical approach for modeling dialogue acts in conversational speech, i.e., speech-act-like units such as Statement, Question, Backchannel, Agreement, Disagreement, and Apology. Our model detects and predicts dialogue acts based on lexical, collocational, and prosodic cues, as well as on the discourse coherence of the dialogue act sequence. The dialogue model is based on treating the discourse structure of a conversation as a hidden Markov model and the individual dialogue acts as observations emanating from the model states. Constraints on the likely sequence of dialogue acts are modeled via a dialogue act n-gram. The statistical dialogue grammar is combined with word n-grams, decision trees, and neural networks modeling the idiosyncratic lexical and prosodic manifestations of each dialogue act. We develop a probabilistic integration of speech recognition with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Topic Modeling · Natural Language Processing Techniques
