An Efficient Load Balancing Method for Tree Algorithms
Osama Talaat Ibrahim, Ahmed El-Mahdy

TL;DR
This paper introduces a hybrid load balancing technique for tree algorithms that uses statistical sampling to efficiently distribute work across multiple processors, improving scalability on parallel systems.
Contribution
It presents a novel hybrid load balancing method leveraging statistical sampling to handle irregular tree structures in parallel algorithms.
Findings
Scalable performance up to 60 processors on Intel Xeon Phi.
Effective load balancing on both regular and irregular trees.
Potential scalability extrapolated to 128 processors.
Abstract
Nowadays, multiprocessing is mainstream with exponentially increasing number of processors. Load balancing is, therefore, a critical operation for the efficient execution of parallel algorithms. In this paper we consider the fundamental class of tree-based algorithms that are notoriously irregular, and hard to load-balance with existing static techniques. We propose a hybrid load balancing method using the utility of statistical random sampling in estimating the tree depth and node count distributions to uniformly partition an input tree. To conduct an initial performance study, we implemented the method on an Intel Xeon Phi accelerator system. We considered the tree traversal operation on both regular and irregular unbalanced trees manifested by Fibonacci and unbalanced (biased) randomly generated trees, respectively. The results show scalable performance for up to the 60 physical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
