GraphTango: A Hybrid Representation Format for Efficient Streaming Graph Updates and Analysis
Alif Ahmed, Farzana Ahmed Siddique, Kevin Skadron

TL;DR
GraphTango introduces a hybrid graph representation that adapts to vertex degree, significantly improving streaming graph update and analysis throughput across diverse graph types.
Contribution
It proposes a novel hybrid format that dynamically switches among three representations based on vertex degree, optimizing performance for streaming graph processing.
Findings
4.5x higher insertion throughput
3.2x higher deletion throughput
1.1x higher analytics throughput
Abstract
Streaming graph processing involves performing updates and analytics on a time-evolving graph. The underlying representation format largely determines the throughputs of these updates and analytics phases. Existing formats usually employ variations of hash tables or adjacency lists. However, adjacency-list-based approaches perform poorly on heavy-tailed graphs, and the hash-based approaches suffer on short-tailed graphs. We propose GraphTango, a hybrid format that provides excellent update and analytics throughput regardless of the graph's degree distribution. GraphTango switches among three different formats based on a vertex's degree: i) Low-degree vertices store the edges directly with the neighborhood metadata, confining accesses to a single cache line, ii) Medium-degree vertices use adjacency lists, and iii) High-degree vertices use hash tables as well as adjacency lists. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Cloud Computing and Resource Management · Caching and Content Delivery
