Semi-supervised deep learning based on label propagation in a 2D embedded space
Barbara Caroline Benato, Jancarlo Ferreira Gomes, Alexandru, Cristian Telea, Alexandre Xavier Falc\~ao

TL;DR
This paper introduces a semi-supervised deep learning method that iteratively improves label quality using feature space embedding and label propagation, reducing the need for extensive labeled datasets.
Contribution
It proposes a novel loop combining deep neural network training with label propagation in a 2D embedded space to enhance classification with minimal labeled data.
Findings
Significant accuracy improvements with only 1-5% labeled data.
Effective label propagation using t-SNE and Optimum-Path Forest.
Validated on multiple private and public datasets.
Abstract
While convolutional neural networks need large labeled sets for training images, expert human supervision of such datasets can be very laborious. Proposed solutions propagate labels from a small set of supervised images to a large set of unsupervised ones to obtain sufficient truly-and-artificially labeled samples to train a deep neural network model. Yet, such solutions need many supervised images for validation. We present a loop in which a deep neural network (VGG-16) is trained from a set with more correctly labeled samples along iterations, created by using t-SNE to project the features of its last max-pooling layer into a 2D embedded space in which labels are propagated using the Optimum-Path Forest semi-supervised classifier. As the labeled set improves along iterations, it improves the features of the neural network. We show that this can significantly improve classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
