Correcting Data Imbalance for Semi-Supervised Covid-19 Detection Using X-ray Chest Images
Saul Calderon-Ramirez, Shengxiang-Yang, Armaghan Moemeni, David, Elizondo, Simon Colreavy-Donnelly, Luis Fernando Chavarria-Estrada, Miguel A., Molina-Cabello

TL;DR
This paper enhances semi-supervised deep learning for Covid-19 detection in chest X-ray images by addressing data imbalance through a re-weighting strategy, significantly improving classification accuracy with limited labeled data.
Contribution
It introduces a simple re-weighting method for imbalance correction in semi-supervised learning, combined with MixMatch, to improve Covid-19 detection accuracy on small, imbalanced datasets.
Findings
Re-weighting observations improves accuracy by up to 10%.
The approach is effective across multiple datasets with limited labels.
Statistically significant improvements over non-balanced MixMatch.
Abstract
The Corona Virus (COVID-19) is an internationalpandemic that has quickly propagated throughout the world. The application of deep learning for image classification of chest X-ray images of Covid-19 patients, could become a novel pre-diagnostic detection methodology. However, deep learning architectures require large labelled datasets. This is often a limitation when the subject of research is relatively new as in the case of the virus outbreak, where dealing with small labelled datasets is a challenge. Moreover, in the context of a new highly infectious disease, the datasets are also highly imbalanced,with few observations from positive cases of the new disease. In this work we evaluate the performance of the semi-supervised deep learning architecture known as MixMatch using a very limited number of labelled observations and highly imbalanced labelled dataset. We propose a simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
