Efficient Distributed Semi-Supervised Learning using Stochastic   Regularization over Affinity Graphs

Sunil Thulasidasan; Jeffrey Bilmes; Garrett Kenyon

arXiv:1612.04898·stat.ML·May 31, 2018

Efficient Distributed Semi-Supervised Learning using Stochastic Regularization over Affinity Graphs

Sunil Thulasidasan, Jeffrey Bilmes, Garrett Kenyon

PDF

Open Access

TL;DR

This paper introduces a scalable, stochastic graph-regularization method for semi-supervised deep learning that improves accuracy with limited labels and enables efficient distributed training.

Contribution

It presents a novel stochastic graph-based mini-batch construction technique that enhances semi-supervised DNN training in parallel or distributed environments.

Findings

01

Significant accuracy improvements with low labeled data

02

Achieves notable speed-up in distributed training

03

Effective for speech corpus semi-supervised learning

Abstract

We describe a computationally efficient, stochastic graph-regularization technique that can be utilized for the semi-supervised training of deep neural networks in a parallel or distributed setting. We utilize a technique, first described in [13] for the construction of mini-batches for stochastic gradient descent (SGD) based on synthesized partitions of an affinity graph that are consistent with the graph structure, but also preserve enough stochasticity for convergence of SGD to good local minima. We show how our technique allows a graph-based semi-supervised loss function to be decomposed into a sum over objectives, facilitating data parallelism for scalable training of machine learning models. Empirical results indicate that our method significantly improves classification accuracy compared to the fully-supervised case when the fraction of labeled data is low, and in the parallel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Privacy-Preserving Technologies in Data

MethodsStochastic Gradient Descent