Disfluency Detection using a Bidirectional LSTM

Vicky Zayats; Mari Ostendorf; Hannaneh Hajishirzi

arXiv:1604.03209·cs.CL·April 13, 2016

Disfluency Detection using a Bidirectional LSTM

Vicky Zayats, Mari Ostendorf, Hannaneh Hajishirzi

PDF

TL;DR

This paper presents a novel disfluency detection method using a Bidirectional LSTM that incorporates pattern match features and ILP constraints, achieving state-of-the-art results on Switchboard.

Contribution

It introduces a BLSTM model with pattern match features and ILP-based constraints for improved disfluency detection performance.

Findings

01

Achieves state-of-the-art performance on Switchboard disfluency detection

02

Better detection of non-repetition disfluencies

03

Model outperforms previous methods in both detection and correction tasks

Abstract

We introduce a new approach for disfluency detection using a Bidirectional Long-Short Term Memory neural network (BLSTM). In addition to the word sequence, the model takes as input pattern match features that were developed to reduce sensitivity to vocabulary size in training, which lead to improved performance over the word sequence alone. The BLSTM takes advantage of explicit repair states in addition to the standard reparandum states. The final output leverages integer linear programming to incorporate constraints of disfluency structure. In experiments on the Switchboard corpus, the model achieves state-of-the-art performance for both the standard disfluency detection task and the correction detection task. Analysis shows that the model has better detection of non-repetition disfluencies, which tend to be much harder to detect.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.