Hadoop in Low-Power Processors
Da Zheng, Alexander Szalay, Andreas Terzis

TL;DR
This paper evaluates the performance and energy efficiency of Amdahl blades, low-power systems with Atom processors, running Hadoop, and proposes techniques to mitigate CPU bottlenecks, highlighting the need for multi-core configurations.
Contribution
It provides a detailed analysis of Hadoop performance on low-power Amdahl blades and introduces techniques to improve efficiency, emphasizing multi-core requirements.
Findings
Amdahl blades are significantly more energy-efficient than conventional clusters.
Disk and network I/O are CPU-heavy on Atom processors, limiting performance.
Three techniques effectively reduce CPU load and enhance Hadoop performance.
Abstract
In our previous work we introduced a so-called Amdahl blade microserver that combines a low-power Atom processor, with a GPU and an SSD to provide a balanced and energy-efficient system. Our preliminary results suggested that the sequential I/O of Amdahl blades can be ten times higher than that a cluster of conventional servers with comparable power consumption. In this paper we investigate the performance and energy efficiency of Amdahl blades running Hadoop. Our results show that Amdahl blades are 7.7 times and 3.4 times as energy-efficient as the Open Cloud Consortium cluster for a data-intensive and a compute-intensive application, respectively. The Hadoop Distributed Filesystem has relatively poor performance on Amdahl blades because both disk and network I/O are CPU-heavy operations on Atom processors. We demonstrate three effective techniques to reduce CPU consumption and improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Caching and Content Delivery · IoT and Edge/Fog Computing
