Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech
Xiao Wei, Bin Wen, Yuqin Lin, Kai Li, Mingyang gu, Xiaobao Wang, Longbiao Wang, Jianwu Dang

TL;DR
This paper introduces FAL-AD, a framework combining federated learning and data augmentation to improve early Alzheimer's detection from speech, addressing data scarcity and privacy issues effectively.
Contribution
The paper presents a novel framework that integrates voice conversion-based augmentation, adaptive federated learning, and cross-modal fusion for efficient Alzheimer's detection from speech data.
Findings
Achieved 91.52% accuracy on ADReSSo dataset, surpassing centralized methods.
Demonstrated significant efficiency improvements in data utilization.
Validated the framework's effectiveness in privacy-preserving multi-institutional settings.
Abstract
Early diagnosis of Alzheimer's Disease (AD) is crucial for delaying its progression. While AI-based speech detection is non-invasive and cost-effective, it faces a critical data efficiency dilemma due to medical data scarcity and privacy barriers. Therefore, we propose FAL-AD, a novel framework that synergistically integrates federated learning with data augmentation to systematically optimize data efficiency. Our approach delivers three key breakthroughs: First, absolute efficiency improvement through voice conversion-based augmentation, which generates diverse pathological speech samples via cross-category voice-content recombination. Second, collaborative efficiency breakthrough via an adaptive federated learning paradigm, maximizing cross-institutional benefits under privacy constraints. Finally, representational efficiency optimization by an attentive cross-modal fusion model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · COVID-19 diagnosis using AI
