Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Xiao Wei; Bin Wen; Yuqin Lin; Kai Li; Mingyang gu; Xiaobao Wang; Longbiao Wang; Jianwu Dang

arXiv:2602.14655·cs.CL·February 17, 2026

Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Xiao Wei, Bin Wen, Yuqin Lin, Kai Li, Mingyang gu, Xiaobao Wang, Longbiao Wang, Jianwu Dang

PDF

Open Access

TL;DR

This paper introduces FAL-AD, a framework combining federated learning and data augmentation to improve early Alzheimer's detection from speech, addressing data scarcity and privacy issues effectively.

Contribution

The paper presents a novel framework that integrates voice conversion-based augmentation, adaptive federated learning, and cross-modal fusion for efficient Alzheimer's detection from speech data.

Findings

01

Achieved 91.52% accuracy on ADReSSo dataset, surpassing centralized methods.

02

Demonstrated significant efficiency improvements in data utilization.

03

Validated the framework's effectiveness in privacy-preserving multi-institutional settings.

Abstract

Early diagnosis of Alzheimer's Disease (AD) is crucial for delaying its progression. While AI-based speech detection is non-invasive and cost-effective, it faces a critical data efficiency dilemma due to medical data scarcity and privacy barriers. Therefore, we propose FAL-AD, a novel framework that synergistically integrates federated learning with data augmentation to systematically optimize data efficiency. Our approach delivers three key breakthroughs: First, absolute efficiency improvement through voice conversion-based augmentation, which generates diverse pathological speech samples via cross-category voice-content recombination. Second, collaborative efficiency breakthrough via an adaptive federated learning paradigm, maximizing cross-institutional benefits under privacy constraints. Finally, representational efficiency optimization by an attentive cross-modal fusion model,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · COVID-19 diagnosis using AI