Efficient On-Chip Communication for Parallel Graph-Analytics on Spatial Architectures
Khushal Sethi

TL;DR
This paper introduces a power-law aware graph partitioning and data mapping scheme for spatial architectures, significantly reducing data movement bottlenecks and improving execution speed and energy efficiency in large-scale graph processing.
Contribution
It proposes a novel data partitioning and mapping method tailored for spatial architectures to mitigate data movement bottlenecks in parallel graph processing.
Findings
Execution speed improved by 2-5x
Energy efficiency increased by 2.7-4x
Reduced data movement latency in graph algorithms
Abstract
Large-scale graph processing has drawn great attention in recent years. Most of the modern-day datacenter workloads can be represented in the form of Graph Processing such as MapReduce etc. Consequently, a lot of designs for Domain-Specific Accelerators have been proposed for Graph Processing. Spatial Architectures have been promising in the execution of Graph Processing, where the graph is partitioned into several nodes and each node works in parallel. We conduct experiments to analyze the on-chip movement of data in graph processing on a Spatial Architecture. Based on the observations, we identify a data movement bottleneck, in the execution of such highly parallel processing accelerators. To mitigate the bottleneck we propose a novel power-law aware Graph Partitioning and Data Mapping scheme to reduce the communication latency by minimizing the hop counts on a scalable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterconnection Networks and Systems · Graph Theory and Algorithms · Advanced Memory and Neural Computing
