Investigating the Utility of Multimodal Conversational Technology and Audiovisual Analytic Measures for the Assessment and Monitoring of Amyotrophic Lateral Sclerosis at Scale
Michael Neumann, Oliver Roesler, Jackson Liscombe, Hardik Kothare,, David Suendermann-Oeft, David Pautler, Indu Navar, Aria Anvar, Jochen Kumm,, Raquel Norel, Ernest Fraenkel, Alexander V. Sherman, James D. Berry, Gary L., Pattee, Jun Wang, Jordan R. Green, Vikram Ramanarayanan

TL;DR
This study introduces a cloud-based multimodal platform that automatically analyzes speech to remotely assess and monitor ALS, showing significant differences in speech metrics between healthy controls and patients, and predicting disease severity.
Contribution
The paper presents a novel scalable system using audiovisual speech metrics for remote ALS assessment, demonstrating its effectiveness in differentiating patient groups and predicting disease severity.
Findings
Significant acoustic and visual speech differences between controls and ALS patients.
High sensitivity and specificity of speech metrics in classification tasks.
Effective prediction of ALS severity using audiovisual features.
Abstract
We propose a cloud-based multimodal dialog platform for the remote assessment and monitoring of Amyotrophic Lateral Sclerosis (ALS) at scale. This paper presents our vision, technology setup, and an initial investigation of the efficacy of the various acoustic and visual speech metrics automatically extracted by the platform. 82 healthy controls and 54 people with ALS (pALS) were instructed to interact with the platform and completed a battery of speaking tasks designed to probe the acoustic, articulatory, phonatory, and respiratory aspects of their speech. We find that multiple acoustic (rate, duration, voicing) and visual (higher order statistics of the jaw and lip) speech metrics show statistically significant differences between controls, bulbar symptomatic and bulbar pre-symptomatic patients. We report on the sensitivity and specificity of these metrics using five-fold…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdaptive Label Smoothing
