Snap ML: A Hierarchical Framework for Machine Learning
Celestine D\"unner, Thomas Parnell, Dimitrios Sarigiannis, Nikolas, Ioannou, Andreea Anghel, Gummadi Ravi, Madhusudanan Kandasamy, Haralampos, Pozidis

TL;DR
Snap ML is a hierarchical software framework that accelerates training of generalized linear models by leveraging modern computing architectures, achieving significant speedups over existing tools.
Contribution
The paper introduces a novel hierarchical framework for machine learning that combines recent system and algorithmic advances, with theoretical and practical performance benefits.
Findings
Achieves order-of-magnitude faster training on large datasets
Demonstrates effective GPU acceleration and data streaming
Outperforms TensorFlow and scikit-learn in benchmark tests
Abstract
We describe a new software framework for fast training of generalized linear models. The framework, named Snap Machine Learning (Snap ML), combines recent advances in machine learning systems and algorithms in a nested manner to reflect the hierarchical architecture of modern computing systems. We prove theoretically that such a hierarchical system can accelerate training in distributed environments where intra-node communication is cheaper than inter-node communication. Additionally, we provide a review of the implementation of Snap ML in terms of GPU acceleration, pipelining, communication patterns and software architecture, highlighting aspects that were critical for achieving high performance. We evaluate the performance of Snap ML in both single-node and multi-node environments, quantifying the benefit of the hierarchical scheme and the data streaming functionality, and comparing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Data Stream Mining Techniques · Machine Learning and ELM
MethodsLogistic Regression
