kMatrix: A Space Efficient Streaming Graph Summarization Technique
Oshan Mudannayake, Nalin Ranasinghe

TL;DR
kMatrix is a novel streaming graph summarization method that significantly reduces memory usage while maintaining low error rates, enabling efficient analysis of large-scale graph data streams.
Contribution
Introduces kMatrix, a memory-efficient streaming graph summarization technique that partitions memory based on sampled graph streams, outperforming existing methods in accuracy for the same space.
Findings
kMatrix achieves lower error rates than TCM and gMatrix.
kMatrix uses less memory than existing summarization techniques.
Experimental results validate the efficiency and accuracy of kMatrix.
Abstract
The amount of collected information on data repositories has vastly increased with the advent of the internet. It has become increasingly complex to deal with these massive data streams due to their sheer volume and the throughput of incoming data. Many of these data streams are mapped into graphs, which helps discover some of their properties. However, due to the difficulty in processing massive streaming graphs, they are summarized such that their properties can be approximately evaluated using the summaries. gSketch, TCM, and gMatrix are some of the major streaming graph summarization techniques. Our primary contribution is devising kMatrix, which is much more memory efficient than existing streaming graph summarization techniques. We achieved this by partitioning the allocated memory using a sample of the original graph stream. Through the experiments, we show that kMatrix can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
