Diffusion-Aware Sampling and Estimation in Information Diffusion Networks
Motahareh Eslami Mehdiabadi, Hamid R. Rabiee, Mostafa Salehi

TL;DR
This paper introduces a novel two-step measurement framework for information diffusion networks that leverages diffusion process characteristics to improve sampling and estimation accuracy.
Contribution
It proposes a diffusion-aware sampling and estimation framework with new estimators, the first comprehensive approach for diffusion network measurement.
Findings
Framework outperforms BFS and RW sampling methods by around 35-37% in link-based characteristics.
Utilizes infection times as local information without needing latent network structure.
Introduces three estimators to correct bias in sampled diffusion data.
Abstract
Partially-observed data collected by sampling methods is often being studied to obtain the characteristics of information diffusion networks. However, these methods usually do not consider the behavior of diffusion process. In this paper, we propose a novel two-step (sampling/estimation) measurement framework by utilizing the diffusion process characteristics. To this end, we propose a link-tracing based sampling design which uses the infection times as local information without any knowledge about the latent structure of diffusion network. To correct the bias of sampled data, we introduce three estimators for different categories; link-based, node-based, and cascade-based. To the best of our knowledge, this is the first attempt to introduce a complete measurement framework for diffusion networks. We also show that the estimator plays an important role in correcting the bias of sampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
