Confidence-Guided Data Augmentation for Improved Semi-Supervised Training
Fadoua Khmaissia, Hichem Frigui

TL;DR
This paper introduces a confidence-guided data augmentation method that leverages a VAE to generate synthetic images from challenging samples, enhancing semi-supervised image classification accuracy and robustness.
Contribution
It presents a novel approach combining confidence-based sample selection with VAE-generated synthetic data for improved semi-supervised training.
Findings
Improved classification accuracy on STL10 and CIFAR-100 datasets.
Synthetic data diversifies training data and enhances model robustness.
Outperforms baseline fully supervised models.
Abstract
We propose a new strategy to improve the accuracy and robustness of image classification. First, we train a baseline CNN model. Then, we identify challenging regions in the feature space by identifying all misclassified samples, and correctly classified samples with low confidence values. These samples are then used to train a Variational AutoEncoder (VAE). Next, the VAE is used to generate synthetic images. Finally, the generated synthetic images are used in conjunction with the original labeled images to train a new model in a semi-supervised fashion. Empirical results on benchmark datasets such as STL10 and CIFAR-100 show that the synthetically generated samples can further diversify the training data, leading to improvement in image classification in comparison with the fully supervised baseline approaches using only the available data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Advanced Neural Network Applications · Brain Tumor Detection and Classification
