A Vertex Cut based Framework for Load Balancing and Parallelism Optimization in Multi-core Systems
Guixiang Ma, Yao Xiao, Theodore L. Willke, Nesreen K. Ahmed, Shahin, Nazarian, Paul Bogdan

TL;DR
This paper introduces a vertex cut framework for partitioning LLVM IR graphs to optimize load balancing and parallelism in multi-core systems, significantly enhancing performance for complex applications.
Contribution
It presents a novel vertex cut-based partitioning framework with greedy algorithms and a memory-centric runtime mapping, improving scalability and efficiency.
Findings
Achieves up to 1.86x performance improvement over state-of-the-art methods.
Develops a flexible framework adaptable to various cluster configurations.
Demonstrates effectiveness on multi-core platforms with different cluster sizes.
Abstract
High-level applications, such as machine learning, are evolving from simple models based on multilayer perceptrons for simple image recognition to much deeper and more complex neural networks for self-driving vehicle control systems.The rapid increase in the consumption of memory and computational resources by these models demands the use of multi-core parallel systems to scale the execution of the complex emerging applications that depend on them. However, parallel programs running on high-performance computers often suffer from data communication bottlenecks, limited memory bandwidth, and synchronization overhead due to irregular critical sections. In this paper, we propose a framework to reduce the data communication and improve the scalability and performance of these applications in multi-core systems. We design a vertex cut framework for partitioning LLVM IR graphs into clusters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Parallel Computing and Optimization Techniques · Cloud Computing and Resource Management
