Advancing Stuttering Detection via Data Augmentation, Class-Balanced   Loss and Multi-Contextual Deep Learning

Shakeel A. Sheikh; Md Sahidullah; Fabrice Hirsch; Slim Ouni

arXiv:2302.11343·cs.SD·February 23, 2023·1 cites

Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning

Shakeel A. Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

PDF

Open Access

TL;DR

This paper enhances stuttering detection by combining data augmentation, class-balanced loss, and multi-contextual deep learning, significantly improving detection accuracy especially in imbalanced and data-scarce scenarios.

Contribution

It introduces a multi-branched scheme with class-weighted loss, data augmentation, and multi-contextual modeling to improve stuttering detection performance.

Findings

01

Data augmentation improves F1 by 4.18%.

02

Multi-contextual StutterNet improves F1 by 4.48%.

03

Cross-corpora augmentation boosts F1 by 13.23%.

Abstract

Stuttering is a neuro-developmental speech impairment characterized by uncontrolled utterances (interjections) and core behaviors (blocks, repetitions, and prolongations), and is caused by the failure of speech sensorimotors. Due to its complex nature, stuttering detection (SD) is a difficult task. If detected at an early stage, it could facilitate speech therapists to observe and rectify the speech patterns of persons who stutter (PWS). The stuttered speech of PWS is usually available in limited amounts and is highly imbalanced. To this end, we address the class imbalance problem in the SD domain via a multibranching (MB) scheme and by weighting the contribution of classes in the overall loss function, resulting in a huge improvement in stuttering classes on the SEP-28k dataset over the baseline (StutterNet). To tackle data scarcity, we investigate the effectiveness of data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStuttering Research and Treatment · Phonetics and Phonology Research · Employee Welfare and Language Studies