Machine Learning-based Selection of Graph Partitioning Strategy Using the Characteristics of Graph Data and Algorithm
YoungJoon Park, DongKyu Lee, Tien-Cuong Bui

TL;DR
This paper presents a machine learning approach to automatically select the most efficient graph partitioning strategy based on graph and algorithm features, improving distributed processing performance.
Contribution
It introduces a predictive model that estimates execution times for various strategies and selects the optimal one, utilizing both real and synthetic execution logs.
Findings
Achieves 1.46X faster execution time on average
Reduces performance to 0.95X of the best strategy
Uses features from graph data and algorithm pseudo-code
Abstract
Analyzing large graph data is an essential part of many modern applications, such as social networks. Due to its large computational complexity, distributed processing is frequently employed. This requires graph data to be divided across nodes, and the choice of partitioning strategy has a great impact on the execution time of the task. Yet, there is no one-size-fits-all partitioning strategy that performs well on arbitrary graph data and algorithms. The performance of a strategy depends on the characteristics of the graph data and algorithms. Moreover, due to the complexity of graph data and algorithms, manually identifying the best partitioning strategy is also infeasible. In this work, we propose a machine learning-based approach to select the most appropriate partitioning strategy for a given graph and processing algorithm. Our approach enumerates viable partitioning strategies,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Cloud Computing and Resource Management
