Investigating the Utility of Multimodal Conversational Technology and   Audiovisual Analytic Measures for the Assessment and Monitoring of   Amyotrophic Lateral Sclerosis at Scale

Michael Neumann; Oliver Roesler; Jackson Liscombe; Hardik Kothare,; David Suendermann-Oeft; David Pautler; Indu Navar; Aria Anvar; Jochen Kumm,; Raquel Norel; Ernest Fraenkel; Alexander V. Sherman; James D. Berry; Gary L.; Pattee; Jun Wang; Jordan R. Green; Vikram Ramanarayanan

arXiv:2104.07310·eess.AS·April 16, 2021

Investigating the Utility of Multimodal Conversational Technology and Audiovisual Analytic Measures for the Assessment and Monitoring of Amyotrophic Lateral Sclerosis at Scale

Michael Neumann, Oliver Roesler, Jackson Liscombe, Hardik Kothare,, David Suendermann-Oeft, David Pautler, Indu Navar, Aria Anvar, Jochen Kumm,, Raquel Norel, Ernest Fraenkel, Alexander V. Sherman, James D. Berry, Gary L., Pattee, Jun Wang, Jordan R. Green, Vikram Ramanarayanan

PDF

TL;DR

This study introduces a cloud-based multimodal platform that automatically analyzes speech to remotely assess and monitor ALS, showing significant differences in speech metrics between healthy controls and patients, and predicting disease severity.

Contribution

The paper presents a novel scalable system using audiovisual speech metrics for remote ALS assessment, demonstrating its effectiveness in differentiating patient groups and predicting disease severity.

Findings

01

Significant acoustic and visual speech differences between controls and ALS patients.

02

High sensitivity and specificity of speech metrics in classification tasks.

03

Effective prediction of ALS severity using audiovisual features.

Abstract

We propose a cloud-based multimodal dialog platform for the remote assessment and monitoring of Amyotrophic Lateral Sclerosis (ALS) at scale. This paper presents our vision, technology setup, and an initial investigation of the efficacy of the various acoustic and visual speech metrics automatically extracted by the platform. 82 healthy controls and 54 people with ALS (pALS) were instructed to interact with the platform and completed a battery of speaking tasks designed to probe the acoustic, articulatory, phonatory, and respiratory aspects of their speech. We find that multiple acoustic (rate, duration, voicing) and visual (higher order statistics of the jaw and lip) speech metrics show statistically significant differences between controls, bulbar symptomatic and bulbar pre-symptomatic patients. We report on the sensitivity and specificity of these metrics using five-fold…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdaptive Label Smoothing