COVID-19 Detection in Cough, Breath and Speech using Deep Transfer Learning and Bottleneck Features
Madhurananda Pahar, Marisa Klopper, Robin Warren, Thomas Niesler

TL;DR
This study demonstrates that deep transfer learning and bottleneck feature extraction significantly enhance the accuracy of COVID-19 detection from audio recordings of coughs, breaths, and speech, enabling effective non-contact screening on inexpensive hardware.
Contribution
The paper introduces a novel application of transfer learning and bottleneck features with deep neural networks for COVID-19 detection from audio, achieving high accuracy across multiple sound classes.
Findings
Resnet50 classifier achieved ROC AUC of 0.98 for coughs.
Transfer learning improved model generalization and stability.
Cough sounds carry the strongest COVID-19 signature.
Abstract
We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech. This type of screening is non-contact, does not require specialist medical expertise or laboratory facilities and can be deployed on inexpensive consumer hardware. We use datasets that contain recordings of coughing, sneezing, speech and other noises, but do not contain COVID-19 labels, to pre-train three deep neural networks: a CNN, an LSTM and a Resnet50. These pre-trained networks are subsequently either fine-tuned using smaller datasets of coughing with COVID-19 labels in the process of transfer learning, or are used as bottleneck feature extractors. Results show that a Resnet50 classifier trained by this transfer learning process delivers optimal or near-optimal performance across all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Support Vector Machine · Sigmoid Activation · Long Short-Term Memory
