Discriminant audio properties in deep learning based respiratory   insufficiency detection in Brazilian Portuguese

Marcelo Matheus Gauy; Larissa Cristina Berti; Arnaldo C\^andido Jr,; Augusto Camargo Neto; Alfredo Goldman; Anna Sara Shafferman Levin; Marcus; Martins; Beatriz Raposo de Medeiros; Marcelo Queiroz; Ester Cerdeira Sabino,; Flaviane Romani Fernandes Svartman; Marcelo Finger

arXiv:2405.17569·cs.LG·May 29, 2024

Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese

Marcelo Matheus Gauy, Larissa Cristina Berti, Arnaldo C\^andido Jr,, Augusto Camargo Neto, Alfredo Goldman, Anna Sara Shafferman Levin, Marcus, Martins, Beatriz Raposo de Medeiros, Marcelo Queiroz, Ester Cerdeira Sabino,, Flaviane Romani Fernandes Svartman, Marcelo Finger

PDF

TL;DR

This study explores AI-based detection of respiratory insufficiency through speech analysis, highlighting challenges in generalizing models trained on COVID-19 data to other RI causes.

Contribution

It introduces a new diverse dataset of RI patients beyond COVID-19 and evaluates the generalization limitations of existing AI models.

Findings

01

Models trained on COVID-19 RI data do not generalize well to other RI causes.

02

COVID-19 RI has distinct speech features not present in other RI types.

03

AI models require diverse training data for broader applicability.

Abstract

This work investigates Artificial Intelligence (AI) systems that detect respiratory insufficiency (RI) by analyzing speech audios, thus treating speech as a RI biomarker. Previous works collected RI data (P1) from COVID-19 patients during the first phase of the pandemic and trained modern AI models, such as CNNs and Transformers, which achieved $96.5%$ accuracy, showing the feasibility of RI detection via AI. Here, we collect RI patient data (P2) with several causes besides COVID-19, aiming at extending AI-based RI detection. We also collected control data from hospital patients without RI. We show that the considered models, when trained on P1, do not generalize to P2, indicating that COVID-19 RI has features that may not be found in all RI types.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.