COVID-19 Cough Classification using Machine Learning and Global Smartphone Recordings
Madhurananda Pahar, Marisa Klopper, Robin Warren, Thomas Niesler

TL;DR
This study develops a machine learning-based system to classify COVID-19 from cough recordings on smartphones, demonstrating high accuracy across diverse datasets and offering a non-contact screening tool.
Contribution
It introduces a novel, widely applicable machine learning approach for COVID-19 cough classification using smartphone recordings, with comprehensive evaluation on diverse datasets.
Findings
Resnet50 achieved an AUC of 0.98 in distinguishing COVID-19 positive coughs from healthy ones.
LSTM classifier achieved an AUC of 0.94 in differentiating COVID-19 positive from negative coughs.
COVID-19 coughs are 15-20% shorter than non-COVID coughs.
Abstract
We present a machine learning based COVID-19 cough classifier which can discriminate COVID-19 positive coughs from both COVID-19 negative and healthy coughs recorded on a smartphone. This type of screening is non-contact, easy to apply, and can reduce the workload in testing centres as well as limit transmission by recommending early self-isolation to those who have a cough suggestive of COVID-19. The datasets used in this study include subjects from all six continents and contain both forced and natural coughs, indicating that the approach is widely applicable. The publicly available Coswara dataset contains 92 COVID-19 positive and 1079 healthy subjects, while the second smaller dataset was collected mostly in South Africa and contains 18 COVID-19 positive and 26 COVID-19 negative subjects who have undergone a SARS-CoV laboratory test. Both datasets indicate that COVID-19 positive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSupport Vector Machine · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Logistic Regression
