Revisiting the Design of In-Memory Dynamic Graph Storage
Jixian Su, Chiyu Hao, Shixuan Sun, Hao Zhang, Sen Gao, Jiaxin Jiang,, Yao Chen, Chenyi Zhang, Bingsheng He, Minyi Guo

TL;DR
This paper systematically evaluates in-memory dynamic graph storage methods, revealing significant space overhead and performance issues, and provides insights for future improvements in real-time graph analytics.
Contribution
It introduces a common abstraction and test framework to compare DGS techniques, highlighting their trade-offs and bottlenecks.
Findings
Existing DGS methods have 3.3-10.8x more memory overhead than CSR.
Memory access patterns significantly impact performance on modern architectures.
Fine-grained concurrency control suffers from efficiency, space issues, and contention.
Abstract
The effectiveness of in-memory dynamic graph storage (DGS) for supporting concurrent graph read and write queries is crucial for real-time graph analytics and updates. Various methods have been proposed, for example, LLAMA, Aspen, LiveGraph, Teseo, and Sortledton. These approaches differ significantly in their support for read and write operations, space overhead, and concurrency control. However, there has been no systematic study to explore the trade-offs among these dimensions. In this paper, we evaluate the effectiveness of individual techniques and identify the performance factors affecting these storage methods by proposing a common abstraction for DGS design and implementing a generic test framework based on this abstraction. Our findings highlight several key insights: 1) Existing DGS methods exhibit substantial space overhead. For example, Aspen consumes 3.3-10.8x more memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Cloud Computing and Resource Management · Parallel Computing and Optimization Techniques
