Efficiently Estimating Motif Statistics of Large Networks
Pinghui Wang, John C.S. Lui, Bruno Ribeiro, Don Towsley, Junzhou Zhao, and Xiaohong Guan

TL;DR
This paper introduces sampling algorithms that efficiently estimate network motif statistics in large networks with minimal queries, outperforming existing methods in accuracy and query efficiency.
Contribution
The work presents novel sampling algorithms that require no full network knowledge and provide theoretical guarantees, significantly reducing query complexity for motif estimation.
Findings
Algorithms require fewer queries than state-of-the-art methods.
The methods have theoretical convergence guarantees.
Experimental results show high accuracy with minimal sampling.
Abstract
Exploring statistics of locally connected subgraph patterns (also known as network motifs) has helped researchers better understand the structure and function of biological and online social networks (OSNs). Nowadays the massive size of some critical networks -- often stored in already overloaded relational databases -- effectively limits the rate at which nodes and edges can be explored, making it a challenge to accurately discover subgraph statistics. In this work, we propose sampling methods to accurately estimate subgraph statistics from as few queried nodes as possible. We present sampling algorithms that efficiently and accurately estimate subgraph properties of massive networks. Our algorithms require no pre-computation or complete network topology information. At the same time, we provide theoretical guarantees of convergence. We perform experiments using widely known data sets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Bioinformatics and Genomic Networks · Advanced Graph Neural Networks
