Respondent-driven sampling bias induced by clustering and community   structure in social networks

Luis Enrique Correa Rocha; Anna Ekeus Thorson; Renaud Lambiotte,; Fredrik Liljeros

arXiv:1503.05826·stat.AP·February 9, 2021

Respondent-driven sampling bias induced by clustering and community structure in social networks

Luis Enrique Correa Rocha, Anna Ekeus Thorson, Renaud Lambiotte,, Fredrik Liljeros

PDF

TL;DR

This paper investigates how social network structures like community clustering and heterogeneity affect the bias and accuracy of respondent-driven sampling (RDS) in hidden populations, highlighting potential biases and limitations.

Contribution

It provides a simulation-based analysis of how network features influence RDS bias, offering insights into the method's limitations in complex social networks.

Findings

01

Community size influences prevalence estimates.

02

Low-degree nodes may be under-sampled.

03

Low response rates cause biases but maintain average estimate accuracy.

Abstract

Sampling hidden populations is particularly challenging using standard sampling methods mainly because of the lack of a sampling frame. Respondent-driven sampling (RDS) is an alternative methodology that exploits the social contacts between peers to reach and weight individuals in these hard-to-reach populations. It is a snowball sampling procedure where the weight of the respondents is adjusted for the likelihood of being sampled due to differences in the number of contacts. In RDS, the structure of the social contacts thus defines the sampling process and affects its coverage, for instance by constraining the sampling within a sub-region of the network. In this paper we study the bias induced by network structures such as social triangles, community structure, and heterogeneities in the number of contacts, in the recruitment trees and in the RDS estimator. We simulate different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.