Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese
Marcelo Matheus Gauy, Larissa Cristina Berti, Arnaldo C\^andido Jr,, Augusto Camargo Neto, Alfredo Goldman, Anna Sara Shafferman Levin, Marcus, Martins, Beatriz Raposo de Medeiros, Marcelo Queiroz, Ester Cerdeira Sabino,, Flaviane Romani Fernandes Svartman, Marcelo Finger

TL;DR
This study explores AI-based detection of respiratory insufficiency through speech analysis, highlighting challenges in generalizing models trained on COVID-19 data to other RI causes.
Contribution
It introduces a new diverse dataset of RI patients beyond COVID-19 and evaluates the generalization limitations of existing AI models.
Findings
Models trained on COVID-19 RI data do not generalize well to other RI causes.
COVID-19 RI has distinct speech features not present in other RI types.
AI models require diverse training data for broader applicability.
Abstract
This work investigates Artificial Intelligence (AI) systems that detect respiratory insufficiency (RI) by analyzing speech audios, thus treating speech as a RI biomarker. Previous works collected RI data (P1) from COVID-19 patients during the first phase of the pandemic and trained modern AI models, such as CNNs and Transformers, which achieved accuracy, showing the feasibility of RI detection via AI. Here, we collect RI patient data (P2) with several causes besides COVID-19, aiming at extending AI-based RI detection. We also collected control data from hospital patients without RI. We show that the considered models, when trained on P1, do not generalize to P2, indicating that COVID-19 RI has features that may not be found in all RI types.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
