Span Classification with Structured Information for Disfluency Detection   in Spoken Utterances

Sreyan Ghosh; Sonal Kumar; Yaman Kumar Singla; Rajiv Ratn Shah; S.; Umesh

arXiv:2203.16028·cs.CL·April 19, 2022

Span Classification with Structured Information for Disfluency Detection in Spoken Utterances

Sreyan Ghosh, Sonal Kumar, Yaman Kumar Singla, Rajiv Ratn Shah, S., Umesh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel span classification model that combines transformer-based contextual understanding with dependency tree-structured information via GCNs to improve disfluency detection in spoken language transcripts.

Contribution

It presents a new architecture integrating transformers and GCNs for disfluency detection, leveraging structured dependency information for the first time in this task.

Findings

01

Achieves state-of-the-art results on English Switchboard dataset.

02

Significantly outperforms previous methods.

03

Demonstrates effectiveness of structured information in disfluency detection.

Abstract

Existing approaches in disfluency detection focus on solving a token-level classification task for identifying and removing disfluencies in text. Moreover, most works focus on leveraging only contextual information captured by the linear sequences in text, thus ignoring the structured information in text which is efficiently captured by dependency trees. In this paper, building on the span classification paradigm of entity recognition, we propose a novel architecture for detecting disfluencies in transcripts from spoken utterances, incorporating both contextual information through transformers and long-distance structured information captured by dependency trees, through graph convolutional networks (GCNs). Experimental results show that our proposed model achieves state-of-the-art results on the widely used English Switchboard for disfluency detection and outperforms prior-art by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sreyan88/disfluency-detection-with-span-classification
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Interpreting and Communication in Healthcare