Neural Architecture Search with Multimodal Fusion Methods for Diagnosing   Dementia

Michail Chatzianastasis; Loukas Ilias; Dimitris Askounis; Michalis; Vazirgiannis

arXiv:2302.05894·cs.LG·April 6, 2023·1 cites

Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia

Michail Chatzianastasis, Loukas Ilias, Dimitris Askounis, Michalis, Vazirgiannis

PDF

Open Access

TL;DR

This paper introduces a novel approach combining neural architecture search with advanced multimodal fusion techniques to improve early dementia detection from speech and text, outperforming existing methods.

Contribution

It is the first to integrate NAS with multimodal fusion methods like Bilinear Pooling and Tucker Decomposition for dementia diagnosis from spontaneous speech.

Findings

01

Outperforms state-of-the-art methods on ADReSS dataset

02

Demonstrates the effectiveness of NAS in optimizing CNN architectures for this task

03

Shows that advanced fusion methods improve multimodal integration accuracy

Abstract

Alzheimer's dementia (AD) affects memory, thinking, and language, deteriorating person's life. An early diagnosis is very important as it enables the person to receive medical help and ensure quality of life. Therefore, leveraging spontaneous speech in conjunction with machine learning methods for recognizing AD patients has emerged into a hot topic. Most of the previous works employ Convolutional Neural Networks (CNNs), to process the input signal. However, finding a CNN architecture is a time-consuming process and requires domain expertise. Moreover, the researchers introduce early and late fusion approaches for fusing different modalities or concatenate the representations of the different modalities during training, thus the inter-modal interactions are not captured. To tackle these limitations, first we exploit a Neural Architecture Search (NAS) method to automatically find a high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and dialogue systems

MethodsTuckER