Efficient Distributed Semi-Supervised Learning using Stochastic Regularization over Affinity Graphs
Sunil Thulasidasan, Jeffrey Bilmes, Garrett Kenyon

TL;DR
This paper introduces a scalable, stochastic graph-regularization method for semi-supervised deep learning that improves accuracy with limited labels and enables efficient distributed training.
Contribution
It presents a novel stochastic graph-based mini-batch construction technique that enhances semi-supervised DNN training in parallel or distributed environments.
Findings
Significant accuracy improvements with low labeled data
Achieves notable speed-up in distributed training
Effective for speech corpus semi-supervised learning
Abstract
We describe a computationally efficient, stochastic graph-regularization technique that can be utilized for the semi-supervised training of deep neural networks in a parallel or distributed setting. We utilize a technique, first described in [13] for the construction of mini-batches for stochastic gradient descent (SGD) based on synthesized partitions of an affinity graph that are consistent with the graph structure, but also preserve enough stochasticity for convergence of SGD to good local minima. We show how our technique allows a graph-based semi-supervised loss function to be decomposed into a sum over objectives, facilitating data parallelism for scalable training of machine learning models. Empirical results indicate that our method significantly improves classification accuracy compared to the fully-supervised case when the fraction of labeled data is low, and in the parallel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Privacy-Preserving Technologies in Data
MethodsStochastic Gradient Descent
