Interpreting glottal flow dynamics for detecting COVID-19 from voice

Soham Deshmukh; Mahmoud Al Ismail; Rita Singh

arXiv:2010.16318·eess.AS·June 15, 2021

Interpreting glottal flow dynamics for detecting COVID-19 from voice

Soham Deshmukh, Mahmoud Al Ismail, Rita Singh

PDF

TL;DR

This paper introduces a CNN-based method analyzing differences in inferred and modeled glottal flow waveforms to detect COVID-19 from voice, revealing respiratory impairments through voice production anomalies.

Contribution

It proposes a novel approach combining physical modeling and deep learning to identify COVID-19 related voice anomalies, which has not been explored before.

Findings

01

Effective detection of COVID-19 from voice using the proposed method

02

Identification of significant voice features linked to respiratory impairment

03

Demonstrated viability on a clinical dataset

Abstract

In the pathogenesis of COVID-19, impairment of respiratory functions is often one of the key symptoms. Studies show that in these cases, voice production is also adversely affected -- vocal fold oscillations are asynchronous, asymmetrical and more restricted during phonation. This paper proposes a method that analyzes the differential dynamics of the glottal flow waveform (GFW) during voice production to identify features in them that are most significant for the detection of COVID-19 from voice. Since it is hard to measure this directly in COVID-19 patients, we infer it from recorded speech signals and compare it to the GFW computed from physical model of phonation. For normal voices, the difference between the two should be minimal, since physical models are constructed to explain phonation under assumptions of normalcy. Greater differences implicate anomalies in the bio-physical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.