# Tuberculosis Screening from Cough Audio: Baseline Models, Clinical Variables, and Uncertainty Quantification

**Authors:** George P. Kafentzis, Efstratios Selisios

PMC · DOI: 10.3390/s26041223 · 2026-02-13

## TL;DR

This paper introduces a standardized machine learning framework for detecting tuberculosis from cough audio and clinical data to enable fair comparisons and progress in TB screening.

## Contribution

The paper provides a reproducible baseline framework for TB detection using cough audio and clinical metadata, enabling fair benchmarking across studies.

## Key findings

- A standardized pipeline for TB detection is proposed with end-to-end reproducibility and uncertainty quantification.
- Performance is evaluated for audio-only and combined audio + clinical metadata models using consistent clinical metrics.
- The full experimental protocol is released to facilitate benchmarking and reduce methodological variance in the field.

## Abstract

In this paper, we propose a standardized framework for automatic tuberculosis (TB) detection from cough audio and routinely collected clinical data using machine learning. While TB screening from audio has attracted growing interest, progress is difficult to measure because existing studies vary substantially in datasets, cohort definitions, feature representations, model families, validation protocols, and reported metrics. Consequently, reported gains are often not directly comparable, and it remains unclear whether improvements stem from modeling advances or from differences in data and evaluation. We address this gap by establishing a strong, well-documented baseline for TB prediction using cough recordings and accompanying clinical metadata from a recently compiled dataset from several countries. Our pipeline is reproducible end-to-end, covering feature extraction, multimodal fusion, cougher-independent evaluation, and uncertainty quantification, and it reports a consistent suite of clinically relevant metrics to enable fair comparison. We further quantify performance for cough audio-only and fused (audio + clinical metadata) models, and release the full experimental protocol to facilitate benchmarking. This baseline is intended to serve as a common reference point and to reduce methodological variance that currently holds back progress in the field.

## Linked entities

- **Diseases:** tuberculosis (MONDO:0018076)

## Full-text entities

- **Diseases:** TB (MESH:D014376), overweight (MESH:D050177), fatigue (MESH:D005221), COPD (MESH:D029424), chest pain (MESH:D002637), fever (MESH:D005334), HIV/AIDS (MESH:D015658), Parkinson's disease (MESH:D010300), respiratory diseases (MESH:D012140), deaths (MESH:D003643), injury to (MESH:D014947), AB (MESH:D001249), weight loss (MESH:D015431), CODA (MESH:D003371), COVID-19 (MESH:D000086382), pulmonary conditions (MESH:D008171), infected (MESH:D007239), pulmonary TB (MESH:D014397)
- **Chemicals:** CatBoost (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mycobacterium tuberculosis (species) [taxon 1773]

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944474/full.md

---
Source: https://tomesphere.com/paper/PMC12944474