GraphScale: Scalable Bandwidth-Efficient Graph Processing on FPGAs
Jonas Dann, Daniel Ritter, Holger Fr\"oning

TL;DR
GraphScale introduces a scalable FPGA-based framework that enhances graph processing efficiency by combining multi-channel memory, asynchronous computation, and compressed graph representations, significantly improving performance and scalability.
Contribution
It is the first framework to integrate multi-channel memory, asynchronous processing, and compressed graphs for scalable FPGA graph acceleration.
Findings
Achieves faster convergence in graph algorithms.
Reduces memory bandwidth usage significantly.
Demonstrates scalability across multiple memory channels.
Abstract
Recent advances in graph processing on FPGAs promise to alleviate performance bottlenecks with irregular memory access patterns. Such bottlenecks challenge performance for a growing number of important application areas like machine learning and data analytics. While FPGAs denote a promising solution through flexible memory hierarchies and massive parallelism, we argue that current graph processing accelerators either use the off-chip memory bandwidth inefficiently or do not scale well across memory channels. In this work, we propose GraphScale, a scalable graph processing framework for FPGAs. For the first time, GraphScale combines multi-channel memory with asynchronous graph processing (i.e., for fast convergence on results) and a compressed graph representation (i.e., for efficient usage of memory bandwidth and reduced memory footprint). GraphScale solves common graph problems like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Graph Theory and Algorithms · Interconnection Networks and Systems
