Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection
Tianzi Wang, Jiajun Deng, Mengzhe Geng, Zi Ye, Shoukang Hu, Yi Wang,, Mingyu Cui, Zengrui Jin, Xunying Liu, Helen Meng

TL;DR
This paper develops a Conformer-based speech recognition system for early Alzheimer's detection, achieving high accuracy by integrating advanced modeling features and adaptation techniques on elderly speech data.
Contribution
It introduces a novel combination of neural architecture search, speaker adaptation, and rescoring methods to improve elderly speech recognition for Alzheimer's detection.
Findings
13.6% absolute WER reduction achieved
Speech recognition accuracy of 91.7% for AD detection
Enhanced elderly speech recognition performance
Abstract
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care to delay further progression. This paper presents the development of a state-of-the-art Conformer based speech recognition system built on the DementiaBank Pitt corpus for automatic AD detection. The baseline Conformer system trained with speed perturbation and SpecAugment based data augmentation is significantly improved by incorporating a set of purposefully designed modeling features, including neural architecture search based auto-configuration of domain-specific Conformer hyper-parameters in addition to parameter fine-tuning; fine-grained elderly speaker adaptation using learning hidden unit contributions (LHUC); and two-pass cross-system rescoring based combination with hybrid TDNN systems. An overall word error rate (WER) reduction of 13.6% absolute (34.8% relative) was obtained on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
