Automatic Tuberculosis and COVID-19 cough classification using deep learning
Madhurananda Pahar, Marisa Klopper, Byron Reeve, Rob Warren, Grant, Theron, Andreas Diacon, Thomas Niesler

TL;DR
This paper develops a deep learning-based cough classifier that distinguishes tuberculosis and COVID-19 coughs from healthy coughs using noisy audio data, achieving high accuracy and surpassing WHO TB triage standards.
Contribution
It introduces a robust deep transfer learning approach with pre-trained CNN, LSTM, and Resnet50 models for respiratory disease cough classification using diverse, noisy audio data.
Findings
Resnet50 achieved F1-score of 0.9259 for TB vs COVID-19 classification.
Deep transfer learning improved classifier robustness and performance.
The classifier exceeds WHO TB triage test requirements.
Abstract
We present a deep learning based automatic cough classifier which can discriminate tuberculosis (TB) coughs from COVID-19 coughs and healthy coughs. Both TB and COVID-19 are respiratory diseases, contagious, have cough as a predominant symptom and claim thousands of lives each year. The cough audio recordings were collected at both indoor and outdoor settings and also uploaded using smartphones from subjects around the globe, thus containing various levels of noise. This cough data include 1.68 hours of TB coughs, 18.54 minutes of COVID-19 coughs and 1.69 hours of healthy coughs from 47 TB patients, 229 COVID-19 patients and 1498 healthy patients and were used to train and evaluate a CNN, LSTM and Resnet50. These three deep architectures were also pre-trained on 2.14 hours of sneeze, 2.91 hours of speech and 2.79 hours of noise for improved performance. The class-imbalance in our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Synthetic Minority Over-sampling Technique.
