Characterizing Branching Processes from Sampled Data
Fabricio Murai, Bruno Ribeiro, Don Towsley, Krista Gile

TL;DR
This paper introduces two methods to estimate offspring distributions in branching processes from limited sampled data, enabling accurate modeling of population dynamics with as little as 14% sampling.
Contribution
The authors propose novel techniques for estimating offspring distributions from sampled data, reducing the need for extensive data collection in modeling branching processes.
Findings
Accurate offspring distribution estimates are achievable with only 14% of the population sampled.
The methods outperform traditional guesses in modeling population evolution.
Sampling with ancestor identity information improves estimation accuracy.
Abstract
Branching processes model the evolution of populations of agents that randomly generate offsprings. These processes, more patently Galton-Watson processes, are widely used to model biological, social, cognitive, and technological phenomena, such as the diffusion of ideas, knowledge, chain letters, viruses, and the evolution of humans through their Y-chromosome DNA or mitochondrial RNA. A practical challenge of modeling real phenomena using a Galton-Watson process is the offspring distribution, which must be measured from the population. In most cases, however, directly measuring the offspring distribution is unrealistic due to lack of resources or the death of agents. So far, researchers have relied on informed guesses to guide their choice of offspring distribution. In this work we propose two methods to estimate the offspring distribution from real sampled data. Using a small sampled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Evolution and Genetic Dynamics · Complex Network Analysis Techniques
