Kaleido: An Efficient Out-of-core Graph Mining System on A Single Machine
Cheng Zhao, Zhibin Zhang, Peng Xu, Tianqi Zheng, Xueqi Cheng

TL;DR
Kaleido is a single-machine, out-of-core graph mining system that efficiently handles large intermediate data by treating disks as memory extensions, outperforming existing systems significantly.
Contribution
Kaleido introduces a novel out-of-core system that uses tensor representations and eigenvalue-based methods for efficient subgraph isomorphism detection on a single machine.
Findings
Kaleido outperforms Arabesque and RStream by 12.3× and 40.0× respectively.
Uses a tensor-based approach for intermediate data management.
Employs eigenvalue techniques to solve subgraph isomorphism efficiently.
Abstract
Graph mining is one of the most important categories of graph algorithms. However, exploring the subgraphs of an input graph produces a huge amount of intermediate data. The 'think like a vertex' programming paradigm, pioneered by Pregel, cannot readily formulate mining problems, which is designed to produce graph computation problems like PageRank. Existing mining systems like Arabesque and RStream need large amounts of computing and memory resources. In this paper, we present Kaleido, an efficient single machine, out-of-core graph mining system which treats disks as an extension of memory. Kaleido treats intermediate data in graph mining tasks as a tensor and adopts a succinct data structure for the intermediate data. Kaleido utilizes the eigenvalue of the adjacency matrix of a subgraph to efficiently solve the subgraph isomorphism problems with an acceptable constraint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Data Management and Algorithms
