Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification
George P. Kafentzis, Efstratios Selisios

TL;DR
This paper establishes a standardized, reproducible framework for TB detection from cough audio and clinical data, enabling fair comparison and benchmarking of models to advance research in this area.
Contribution
It introduces a comprehensive baseline pipeline for TB prediction using cough audio and clinical data, with detailed evaluation and open protocol for benchmarking.
Findings
Baseline models achieve consistent performance metrics.
Multimodal fusion improves TB detection accuracy.
Open protocol facilitates fair comparison across studies.
Abstract
In this paper, we propose a standardized framework for automatic tuberculosis (TB) detection from cough audio and routinely collected clinical data using machine learning. While TB screening from audio has attracted growing interest, progress is difficult to measure because existing studies vary substantially in datasets, cohort definitions, feature representations, model families, validation protocols, and reported metrics. Consequently, reported gains are often not directly comparable, and it remains unclear whether improvements stem from modeling advances or from differences in data and evaluation. We address this gap by establishing a strong, well-documented baseline for TB prediction using cough recordings and accompanying clinical metadata from a recently compiled dataset from several countries. Our pipeline is reproducible end-to-end, covering feature extraction, multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRespiratory and Cough-Related Research · COVID-19 diagnosis using AI · Phonocardiography and Auscultation Techniques
