Contrasting Deep Learning Models for Direct Respiratory Insufficiency Detection Versus Blood Oxygen Saturation Estimation
Marcelo Matheus Gauy, Natalia Hitomi Koza, Ricardo Mikio Morita,, Gabriel Rocha Stanzione, Arnaldo Candido Junior, Larissa Cristina Berti, Anna, Sara Shafferman Levin, Ester Cerdeira Sabino, Flaviane Romani Fernandes, Svartman, Marcelo Finger

TL;DR
This study compares deep learning models for respiratory insufficiency detection and blood oxygen saturation estimation from audio, finding high accuracy in classification but limited success in precise SpO2 level prediction.
Contribution
It demonstrates that pretrained audio neural networks excel at classifying respiratory conditions but are less effective for accurate SpO2 level estimation from audio data.
Findings
Deep learning models achieve near-perfect accuracy in RI detection.
Models fail to accurately regress SpO2 levels, exceeding clinical error margins.
Transforming SpO2 estimation into a binary classification reduces performance but remains limited.
Abstract
We contrast high effectiveness of state of the art deep learning architectures designed for general audio classification tasks, refined for respiratory insufficiency (RI) detection and blood oxygen saturation (SpO) estimation and classification through automated audio analysis. Recently, multiple deep learning architectures have been proposed to detect RI in COVID patients through audio analysis, achieving accuracy above 95% and F1-score above 0.93. RI is a condition associated with low SpO levels, commonly defined as the threshold SpO <92%. While SpO serves as a crucial determinant of RI, a medical doctor's diagnosis typically relies on multiple factors. These include respiratory frequency, heart rate, SpO levels, among others. Here we study pretrained audio neural networks (CNN6, CNN10 and CNN14) and the Masked Autoencoder (Audio-MAE) for RI detection, where these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNon-Invasive Vital Sign Monitoring · Hemodynamic Monitoring and Therapy · Heart Rate Variability and Autonomic Control
