Revisiting Rule-Based Stuttering Detection: A Comprehensive Analysis of Interpretable Models for Clinical Applications
Eric Zhang

TL;DR
This paper analyzes rule-based stuttering detection methods, enhancing their interpretability and performance, especially for clinical use, by integrating acoustic features and hierarchical decision structures, and compares them with neural approaches.
Contribution
It introduces an improved rule-based framework with speaking-rate normalization and hierarchical decisions, demonstrating competitive accuracy and interpretability for clinical applications.
Findings
Rule-based systems achieve 97-99% accuracy in prolongation detection.
They maintain stable performance across different speaking rates.
Rule-based models can be integrated with modern AI systems as proposal generators.
Abstract
Stuttering affects approximately 1% of the global population, impacting communication and quality of life. While recent advances in deep learning have pushed the boundaries of automatic speech dysfluency detection, rule-based approaches remain crucial for clinical applications where interpretability and transparency are paramount. This paper presents a comprehensive analysis of rule-based stuttering detection systems, synthesizing insights from multiple corpora including UCLASS, FluencyBank, and SEP-28k. We propose an enhanced rule-based framework that incorporates speaking-rate normalization, multi-level acoustic feature analysis, and hierarchical decision structures. Our approach achieves competitive performance while maintaining complete interpretability-critical for clinical adoption. We demonstrate that rule-based systems excel particularly in prolongation detection (97-99%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
