Identifying Discourse Markers in Spoken Dialog

Peter A. Heeman (Oregon Graduate Institute); Donna Byron (U. of; Rochester); James F. Allen (U. of Rochester)

arXiv:cmp-lg/9801002·cmp-lg·May 23, 2007·33 cites

Identifying Discourse Markers in Spoken Dialog

Peter A. Heeman (Oregon Graduate Institute), Donna Byron (U. of, Rochester), James F. Allen (U. of Rochester)

PDF

Open Access

TL;DR

This paper introduces a machine learning method that uses POS tagging to identify discourse markers in spontaneous speech, enhancing speech recognition and dialog act prediction.

Contribution

It proposes a novel approach integrating POS tagging into language modeling for discourse marker detection in speech recognition systems.

Findings

01

Discourse markers can be identified using POS tags during speech recognition.

02

Incorporating discourse markers improves dialog act prediction.

03

The method outperforms previous approaches in identifying discourse markers.

Abstract

In this paper, we present a method for identifying discourse marker usage in spontaneous speech based on machine learning. Discourse markers are denoted by special POS tags, and thus the process of POS tagging can be used to identify discourse markers. By incorporating POS tagging into language modeling, discourse markers can be identified during speech recognition, in which the timeliness of the information can be used to help predict the following words. We contrast this approach with an alternative machine learning approach proposed by Litman (1996). This paper also argues that discourse markers can be used to help the hearer predict the role that the upcoming utterance plays in the dialog. Thus discourse markers should provide valuable evidence for automatic dialog act prediction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Natural Language Processing Techniques · Topic Modeling