# MoTAS: MoE-Guided Feature Selection from TTS-Augmented Speech for Enhanced Multimodal Alzheimer's Early Screening

**Authors:** Yongqi Shao, Binxin Mei, Cong Tan, Hong Huo, Tao Fang

arXiv: 2508.20513 · 2025-08-29

## TL;DR

MoTAS introduces a novel framework combining TTS data augmentation and MoE-based feature selection to improve early Alzheimer's screening from speech, achieving state-of-the-art accuracy in limited data scenarios.

## Contribution

The paper presents MoTAS, a new method integrating TTS augmentation and MoE for adaptive feature selection in multimodal speech analysis for Alzheimer's detection.

## Key findings

- Achieves 85.71% accuracy on ADReSSo dataset.
- Outperforms existing baseline methods.
- Validates effectiveness of TTS and MoE components through ablation studies.

## Abstract

Early screening for Alzheimer's Disease (AD) through speech presents a promising non-invasive approach. However, challenges such as limited data and the lack of fine-grained, adaptive feature selection often hinder performance. To address these issues, we propose MoTAS, a robust framework designed to enhance AD screening efficiency. MoTAS leverages Text-to-Speech (TTS) augmentation to increase data volume and employs a Mixture of Experts (MoE) mechanism to improve multimodal feature selection, jointly enhancing model generalization. The process begins with automatic speech recognition (ASR) to obtain accurate transcriptions. TTS is then used to synthesize speech that enriches the dataset. After extracting acoustic and text embeddings, the MoE mechanism dynamically selects the most informative features, optimizing feature fusion for improved classification. Evaluated on the ADReSSo dataset, MoTAS achieves a leading accuracy of 85.71\%, outperforming existing baselines. Ablation studies further validate the individual contributions of TTS augmentation and MoE in boosting classification performance. These findings highlight the practical value of MoTAS in real-world AD screening scenarios, particularly in data-limited settings.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.20513/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/2508.20513/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/2508.20513/full.md

---
Source: https://tomesphere.com/paper/2508.20513