Performance and Energy Consumption of Parallel Machine Learning Algorithms
Xidong Wu, Preston Brazzle, Stephen Cahoon

TL;DR
This paper evaluates the performance and energy consumption of parallel machine learning algorithms, including logistic regression, genetic algorithms, and neural networks, across different parallelization methods and data sizes.
Contribution
It provides a comparative analysis of power and performance impacts of various parallelization techniques on machine learning training algorithms.
Findings
Parallel efficiency varies with model complexity and data size.
GPU acceleration significantly speeds up training.
Power consumption is influenced by parallelization method and data scale.
Abstract
Machine learning models have achieved remarkable success in various real-world applications such as data science, computer vision, and natural language processing. However, model training in machine learning requires large-scale data sets and multiple iterations before it can work properly. Parallelization of training algorithms is a common strategy to speed up the process of training. However, many studies on model training and inference focus only on aspects of performance. Power consumption is also an important metric for any type of computation, especially high-performance applications. Machine learning algorithms that can be used on low-power platforms such as sensors and mobile devices have been researched, but less power optimization is done for algorithms designed for high-performance computing. In this paper, we present a C++ implementation of logistic regression and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Neural Networks and Applications · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Logistic Regression
