Multiple seed structure and disconnected networks in respondent-driven sampling
Jens Malmros, Luis E.C. Rocha

TL;DR
This paper introduces a new estimator for respondent-driven sampling that accounts for multiple seeds and disconnected networks, improving accuracy over previous methods especially with many seeds.
Contribution
The paper presents a novel estimator based on a random walk with teleportation that handles multiple seeds and disconnected social networks in RDS.
Findings
Estimator outperforms previous methods in simulations.
Effective for large seed proportions and disconnected networks.
Recommended for use in complex RDS studies.
Abstract
Respondent-driven sampling (RDS) is a link-tracing sampling method that is especially suitable for sampling hidden populations. RDS combines an efficient snowball-type sampling scheme with inferential procedures that yield unbiased population estimates under some assumptions about the sampling procedure and population structure. Several seed individuals are typically used to initiate RDS recruitment. However, standard RDS estimation theory assume that all sampled individuals originate from only one seed. We present an estimator, based on a random walk with teleportation, which accounts for the multiple seed structure of RDS. The new estimator can also be used on populations with disconnected social networks. We numerically evaluate our estimator by simulations on artificial and real networks. Our estimator outperforms previous estimators, especially when the proportion of seeds in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHIV, Drug Use, Sexual Risk · HIV/AIDS Research and Interventions · Homelessness and Social Issues
