Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification
George P. Kafentzis, Efstratios Selisios

TL;DR
This paper introduces a standardized machine learning framework for detecting tuberculosis from cough audio and clinical data to enable fair comparisons and progress in TB screening.
Contribution
The paper provides a reproducible baseline framework for TB detection using cough audio and clinical metadata, enabling fair benchmarking across studies.
Findings
A standardized pipeline for TB detection is proposed with end-to-end reproducibility and uncertainty quantification.
Performance is evaluated for audio-only and combined audio + clinical metadata models using consistent clinical metrics.
The full experimental protocol is released to facilitate benchmarking and reduce methodological variance in the field.
Abstract
In this paper, we propose a standardized framework for automatic tuberculosis (TB) detection from cough audio and routinely collected clinical data using machine learning. While TB screening from audio has attracted growing interest, progress is difficult to measure because existing studies vary substantially in datasets, cohort definitions, feature representations, model families, validation protocols, and reported metrics. Consequently, reported gains are often not directly comparable, and it remains unclear whether improvements stem from modeling advances or from differences in data and evaluation. We address this gap by establishing a strong, well-documented baseline for TB prediction using cough recordings and accompanying clinical metadata from a recently compiled dataset from several countries. Our pipeline is reproducible end-to-end, covering feature extraction, multimodal…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRespiratory and Cough-Related Research · COVID-19 diagnosis using AI · Phonocardiography and Auscultation Techniques
