Typical vs. Atypical Disfluency Classification: Introducing the   IIITH-TISA Corpus and Temporal Context-Based Feature Representations

Priyanka Kommagouni; Vamshiraghusimha Narasinga; Purva Barche; Sai; Akarsh C; Anil Vuppala

arXiv:2411.17149·eess.AS·November 27, 2024·ICASSP

Typical vs. Atypical Disfluency Classification: Introducing the IIITH-TISA Corpus and Temporal Context-Based Feature Representations

Priyanka Kommagouni, Vamshiraghusimha Narasinga, Purva Barche, Sai, Akarsh C, Anil Vuppala

PDF

Open Access

TL;DR

This paper introduces the IIITH-TISA corpus for atypical disfluencies, proposes a novel feature set with a TDNN classifier, and achieves high accuracy in classifying typical versus atypical speech disfluencies.

Contribution

The study presents the first Indian English stammer corpus and a new feature extraction method for improved disfluency classification.

Findings

01

Achieved an average F1 score of 85.01% in disfluency classification.

02

Introduced the IIITH-TISA dataset for atypical disfluencies in Indian English.

03

Demonstrated the effectiveness of PE-ZTWCC and SDC features with TDNN.

Abstract

Speech disfluencies in spontaneous communication can be categorized as either typical or atypical. Typical disfluencies, such as hesitations and repetitions, are natural occurrences in everyday speech, while atypical disfluencies are indicative of pathological disorders like stuttering. Distinguishing between these categories is crucial for improving voice assistants (VAs) for Persons Who Stutter (PWS), who often face premature cutoffs due to misidentification of speech termination. Accurate classification also aids in detecting stuttering early in children, preventing misdiagnosis as language development disfluency. This research introduces the IIITH-TISA dataset, the first Indian English stammer corpus, capturing atypical disfluencies. Additionally, we extend the IIITH-IED dataset with detailed annotations for typical disfluencies. We propose Perceptually Enhanced Zero-Time Windowed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques