GraphLab: A New Framework For Parallel Machine Learning

Yucheng Low; Joseph E. Gonzalez; Aapo Kyrola; Danny Bickson; Carlos E.; Guestrin; Joseph Hellerstein

arXiv:1408.2041·cs.LG·August 12, 2014·441 cites

GraphLab: A New Framework For Parallel Machine Learning

Yucheng Low, Joseph E. Gonzalez, Aapo Kyrola, Danny Bickson, Carlos E., Guestrin, Joseph Hellerstein

PDF

Open Access

TL;DR

GraphLab is a novel parallel framework tailored for machine learning that efficiently expresses asynchronous algorithms with sparse dependencies, enabling high-performance execution on large-scale problems.

Contribution

It introduces GraphLab, a high-level abstraction that simplifies designing parallel ML algorithms with data consistency and performance guarantees.

Findings

01

Successfully implemented parallel belief propagation, Gibbs sampling, Co-EM, Lasso, and Compressed Sensing.

02

Achieved high parallel performance on large-scale real-world datasets.

03

Demonstrated expressiveness and efficiency of GraphLab for diverse ML algorithms.

Abstract

Designing and implementing efficient, provably correct parallel machine learning (ML) algorithms is challenging. Existing high-level parallel abstractions like MapReduce are insufficiently expressive while low-level tools like MPI and Pthreads leave ML experts repeatedly solving the same design challenges. By targeting common patterns in ML, we developed GraphLab, which improves upon abstractions like MapReduce by compactly expressing asynchronous iterative algorithms with sparse computational dependencies while ensuring data consistency and achieving a high degree of parallel performance. We demonstrate the expressiveness of the GraphLab framework by designing and implementing parallel versions of belief propagation, Gibbs sampling, Co-EM, Lasso and Compressed Sensing. We show that using GraphLab we can achieve excellent parallel performance on large scale real-world problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Bayesian Modeling and Causal Inference · Error Correcting Code Techniques