Enhancing Multi-Label Thoracic Disease Diagnosis with Deep Ensemble-Based Uncertainty Quantification
Yasiru Laksara, Uthayasanker Thayasivam

TL;DR
This paper enhances thoracic disease diagnosis by integrating deep ensemble-based uncertainty quantification, significantly improving model calibration, reliability, and interpretability over previous deterministic approaches.
Contribution
It introduces a robust deep ensemble framework that stabilizes performance and provides reliable uncertainty estimates for multi-label thoracic disease diagnosis.
Findings
Achieved state-of-the-art AUROC of 0.8559
Significantly reduced calibration error with mean ECE of 0.0728
Enabled decomposition of uncertainty into aleatoric and epistemic components
Abstract
The utility of deep learning models, such as CheXNet, in high stakes clinical settings is fundamentally constrained by their purely deterministic nature, failing to provide reliable measures of predictive confidence. This project addresses this critical gap by integrating robust Uncertainty Quantification (UQ) into a high performance diagnostic platform for 14 common thoracic diseases on the NIH ChestX-ray14 dataset. Initial architectural development failed to stabilize performance and calibration using Monte Carlo Dropout (MCD), yielding an unacceptable Expected Calibration Error (ECE) of 0.7588. This technical failure necessitated a rigorous architectural pivot to a high diversity, 9-member Deep Ensemble (DE). This resulting DE successfully stabilized performance and delivered superior reliability, achieving a State-of-the-Art (SOTA) average Area Under the Receiver Operating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
