TL;DR
This paper introduces novel random walk-based algorithms for estimating social network properties that effectively handle private nodes, significantly improving accuracy over existing methods.
Contribution
The paper proposes new algorithms that incorporate neighbor selection and weighting strategies to accurately estimate network properties despite private nodes.
Findings
Algorithms improve estimation accuracy by up to 92.6% on real datasets.
Proposed methods effectively mitigate bias caused by private nodes.
New weighting techniques reduce sampling bias and estimation errors.
Abstract
Accurately analyzing graph properties of social networks is a challenging task because of access limitations to the graph data. To address this challenge, several algorithms to obtain unbiased estimates of properties from few samples via a random walk have been studied. However, existing algorithms do not consider private nodes who hide their neighbors in real social networks, leading to some practical problems. Here we design random walk-based algorithms to accurately estimate properties without any problems caused by private nodes. First, we design a random walk-based sampling algorithm that comprises the neighbor selection to obtain samples having the Markov property and the calculation of weights for each sample to correct the sampling bias. Further, for two graph property estimators, we propose the weighting methods to reduce not only the sampling bias but also estimation errors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
