# Diagnostic accuracy of AI-assisted chest radiographs in tuberculosis screening: A Ghanaian clinical study

**Authors:** Derick Seyram Sule, Kofi Adesi Kyei, William Kwadwo Antwi, Godwill Acquah, Klenam Dzefi-Tettey, Joseph Daniels, Andrew Yaw Nyantakyi

PMC · DOI: 10.1371/journal.pone.0342988 · PLOS One · 2026-03-27

## TL;DR

An AI system was more accurate than a radiologist in detecting tuberculosis from chest X-rays in a Ghanaian study.

## Contribution

The study demonstrates AI's superior diagnostic performance over radiologists in TB screening using chest X-rays in a high-burden setting.

## Key findings

- The AI system achieved 91% accuracy, outperforming the radiologist's 86% accuracy in TB screening.
- AI showed higher agreement with GeneXpert MTB/RIF results (κ = 0.79) compared to the radiologist (κ = 0.69).
- AI's sensitivity (86%) and specificity (93%) were higher than the radiologist's (84% and 87%).

## Abstract

Tuberculosis remains a major global health challenge, particularly in resource-limited settings where access to expert radiological interpretation is constrained. Artificial intelligence offers a promising solution to enhance diagnostic accuracy and efficiency in TB screening.

This study aimed to evaluate the diagnostic performance of an AI-based system compared to a radiologist in screening for TB using chest X-rays from 1,010 patients.

Patients were adults ≥18 years with suspected TB in a high-burden setting. GeneXpert MTB/RIF served as reference to assess accuracy, sensitivity, specificity, PPV, NPV, and AUC for radiologist and AI TB predictions. Comparisons used McNemar’s test and Cohen’s kappa to evaluate agreement and significance of differences.

The AI system demonstrated superior performance with an accuracy of 91%, sensitivity of 86%, specificity of 93%, PPV of 85%, NPV of 94%, and AUC of 0.90. In contrast, the radiologist achieved an accuracy of 86%, sensitivity of 84%, specificity of 87%, PPV of 76%, NPV of 92%, and AUC of 0.86. McNemar’s test revealed a statistically significant difference between the two modalities (p = 0.0021). Cohen’s kappa indicated substantial agreement between AI and GeneXpert MTB/RIF result (κ = 0.79), moderate agreement for the radiologist and GeneXpert MTB/RIF result (κ = 0.69), and moderate agreement between radiologist and AI predictions (κ = 0.53).

The AI system outperformed the radiologist in TB screening, demonstrating higher diagnostic accuracy and agreement with GeneXpert MTB/RIF result. These findings support the integration of AI into TB screening workflows, particularly in settings with limited access to expert radiological interpretation.

## Linked entities

- **Diseases:** Tuberculosis (MONDO:0018076)

## Full-text entities

- **Diseases:** TB (MESH:D014376), AI (MESH:C538142), pulmonary abnormalities (MESH:D008171), TB (MESH:D014390), pulmonary tuberculosis (MESH:D014397)
- **Chemicals:** GeneXpert (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13028515/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13028515/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC13028515/full.md

---
Source: https://tomesphere.com/paper/PMC13028515