Reinforcement Learning Enhanced Weighted Sampling for Accurate Subgraph Counting on Fully Dynamic Graph Streams
Kaixin Wang, Cheng Long, Da Yan, Jie Zhang, H. V. Jagadish

TL;DR
This paper introduces a reinforcement learning-based weighted sampling method for more accurate subgraph counting in fully dynamic graph streams, outperforming existing uniform sampling approaches in accuracy and speed.
Contribution
It presents a novel weighted sampling algorithm, WSD, that uses reinforcement learning to determine edge importance, improving subgraph count estimates in dynamic graphs.
Findings
WSD achieves lower estimation errors than uniform sampling methods.
The approach runs faster while maintaining accuracy.
Reinforcement learning effectively determines edge importance in dynamic graphs.
Abstract
As the popularity of graph data increases, there is a growing need to count the occurrences of subgraph patterns of interest, for a variety of applications. Many graphs are massive in scale and also fully dynamic (with insertions and deletions of edges), rendering exact computation of these counts to be infeasible. Common practice is, instead, to use a small set of edges as a sample to estimate the counts. Existing sampling algorithms for fully dynamic graphs sample the edges with uniform probability. In this paper, we show that we can do much better if we sample edges based on their individual properties. Specifically, we propose a weighted sampling algorithm called WSD for estimating the subgraph count in a fully dynamic graph stream, which samples the edges based on their weights that indicate their importance and reflect their properties. We determine the weights of edges in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Graph Neural Networks · Age of Information Optimization
