Design, Configuration, Implementation, and Performance of a Simple 32 Core Raspberry Pi Cluster
Vincent A. Cicirello

TL;DR
This paper details the design and implementation of an inexpensive 8-node, 32-core Raspberry Pi cluster, evaluating its performance on computational tasks and discussing its educational and development use cases.
Contribution
It presents a practical Raspberry Pi cluster setup and analyzes its performance limitations and potential for parallel computing applications.
Findings
Networking limits parallel performance for data-intensive tasks.
Parallel speedup degrades when utilizing all cores due to resource contention.
Using three cores per node achieves near-linear speedup.
Abstract
In this report, I describe the design and implementation of an inexpensive, eight node, 32 core, cluster of raspberry pi single board computers, as well as the performance of this cluster on two computational tasks, one that requires significant data transfer relative to computational time requirements, and one that does not. We have two use-cases for the cluster: (a) as an educational tool for classroom usage, such as covering parallel algorithms in an algorithms course; and (b) as a test system for use during the development of parallel metaheuristics, essentially serving as a personal desktop parallel computing cluster. Our preliminary results show that the slow 100 Mbps networking of the raspberry pi significantly limits such clusters to parallel computational tasks that are either long running relative to data communications requirements, or that which requires very little…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Energy Harvesting in Wireless Networks
