Deep Active Learning for Lung Disease Severity Classification from Chest X-rays: Learning with Less Data in the Presence of Class Imbalance
Roy M. Gabriel, Mohammadreza Zandehshahvar, Marly van Assen, Nattakorn Kittisut, Kyle Peters, Carlo N. De Cecco, Ali Adibi

TL;DR
This study demonstrates that deep active learning with Bayesian Neural Networks and weighted loss functions can effectively reduce labeled data needs for lung disease severity classification from chest X-rays, especially under class imbalance, while maintaining high accuracy.
Contribution
The paper introduces a novel application of deep active learning with BNNs and weighted loss to improve lung disease severity classification from CXRs with less labeled data and class imbalance.
Findings
Entropy Sampling achieved 93.7% accuracy with 15.4% of data
Mean STD sampling achieved 70.3% accuracy with 23.1% of data
Methods outperformed complex acquisition functions and reduced labeling effort
Abstract
To reduce the amount of required labeled data for lung disease severity classification from chest X-rays (CXRs) under class imbalance, this study applied deep active learning with a Bayesian Neural Network (BNN) approximation and weighted loss function. This retrospective study collected 2,319 CXRs from 963 patients (mean age, 59.2 16.6 years; 481 female) at Emory Healthcare affiliated hospitals between January and November 2020. All patients had clinically confirmed COVID-19. Each CXR was independently labeled by 3 to 6 board-certified radiologists as normal, moderate, or severe. A deep neural network with Monte Carlo Dropout was trained using active learning to classify disease severity. Various acquisition functions were used to iteratively select the most informative samples from an unlabeled pool. Performance was evaluated using accuracy, area under the receiver operating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Phonocardiography and Auscultation Techniques · Machine Learning in Healthcare
