Automatic Cough Classification for Tuberculosis Screening in a Real-World Environment
Madhurananda Pahar, Marisa Klopper, Byron Reeve, Grant Theron, Rob, Warren, Thomas Niesler

TL;DR
This study develops a machine learning-based system that classifies cough sounds to effectively screen for tuberculosis, meeting WHO standards and offering a low-cost, deployable solution for high-burden developing countries.
Contribution
The paper introduces a novel cough sound classification approach using machine learning that achieves high accuracy and meets WHO TB screening criteria in real-world settings.
Findings
Best classifier is logistic regression with feature selection.
Achieved AUC of 0.94 with 23 features from MFCCs.
System exceeds WHO sensitivity and specificity requirements.
Abstract
Objective: The automatic discrimination between the coughing sounds produced by patients with tuberculosis (TB) and those produced by patients with other lung ailments. Approach: We present experiments based on a dataset of 1358 forced cough recordings obtained in a developing-world clinic from 16 patients with confirmed active pulmonary TB and 35 patients suffering from respiratory conditions suggestive of TB but confirmed to be TB negative. Using nested cross-validation, we have trained and evaluated five machine learning classifiers: logistic regression (LR), support vector machines (SVM), k-nearest neighbour (KNN), multilayer perceptrons (MLP) and convolutional neural networks (CNN). Main Results: Although classification is possible in all cases, the best performance is achieved using LR. In combination with feature selection by sequential forward selection (SFS), our best LR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection · Logistic Regression
