Dealing with Distribution Mismatch in Semi-supervised Deep Learning for Covid-19 Detection Using Chest X-ray Images: A Novel Approach Using Feature Densities
Saul Calderon-Ramirez, Shengxiang Yang, David Elizondo, Armaghan, Moemeni

TL;DR
This paper investigates the impact of distribution mismatch in semi-supervised deep learning for COVID-19 detection using chest X-ray images and proposes a simple, feature density-based method to mitigate accuracy loss caused by unlabelled data from different sources.
Contribution
It introduces a novel, computationally inexpensive approach using feature density approximation to reduce distribution mismatch effects in semi-supervised models.
Findings
Distribution mismatch causes up to 30% accuracy loss.
The proposed method improves accuracy by up to 32%.
Outperforms existing out-of-distribution detection methods.
Abstract
In the context of the global coronavirus pandemic, different deep learning solutions for infected subject detection using chest X-ray images have been proposed. However, deep learning models usually need large labelled datasets to be effective. Semi-supervised deep learning is an attractive alternative, where unlabelled data is leveraged to improve the overall model's accuracy. However, in real-world usage settings, an unlabelled dataset might present a different distribution than the labelled dataset (i.e. the labelled dataset was sampled from a target clinic and the unlabelled dataset from a source clinic). This results in a distribution mismatch between the unlabelled and labelled datasets. In this work, we assess the impact of the distribution mismatch between the labelled and the unlabelled datasets, for a semi-supervised model trained with chest X-ray images, for COVID-19…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · AI in cancer detection
