Performance and Energy Consumption of Parallel Machine Learning   Algorithms

Xidong Wu; Preston Brazzle; Stephen Cahoon

arXiv:2305.00798·cs.DC·May 2, 2023·2 cites

Performance and Energy Consumption of Parallel Machine Learning Algorithms

Xidong Wu, Preston Brazzle, Stephen Cahoon

PDF

Open Access

TL;DR

This paper evaluates the performance and energy consumption of parallel machine learning algorithms, including logistic regression, genetic algorithms, and neural networks, across different parallelization methods and data sizes.

Contribution

It provides a comparative analysis of power and performance impacts of various parallelization techniques on machine learning training algorithms.

Findings

01

Parallel efficiency varies with model complexity and data size.

02

GPU acceleration significantly speeds up training.

03

Power consumption is influenced by parallelization method and data scale.

Abstract

Machine learning models have achieved remarkable success in various real-world applications such as data science, computer vision, and natural language processing. However, model training in machine learning requires large-scale data sets and multiple iterations before it can work properly. Parallelization of training algorithms is a common strategy to speed up the process of training. However, many studies on model training and inference focus only on aspects of performance. Power consumption is also an important metric for any type of computation, especially high-performance applications. Machine learning algorithms that can be used on low-power platforms such as sensors and mobile devices have been researched, but less power optimization is done for algorithms designed for high-performance computing. In this paper, we present a C++ implementation of logistic regression and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Neural Networks and Applications · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Logistic Regression