Using an online sample to learn about an offline population
Dennis M. Feehan, Curtiss Cobb

TL;DR
This paper demonstrates that online sampling combined with social network insights can effectively estimate offline population characteristics, such as internet access, across multiple countries.
Contribution
It introduces a novel method leveraging online samples and social networks to infer offline population traits, validated through large-scale empirical testing.
Findings
Successfully estimated internet adoption in five countries
Validated the approach with a large sample of approximately 15,000 participants
Provided insights for designing future demographic studies using online data
Abstract
Online data sources offer tremendous promise to demography and other social sciences, but researchers worry that the group of people who are represented in online datasets can be different from the general population. We show that by sampling and anonymously interviewing people who are online, researchers can learn about both people who are online and people who are offline. Our approach is based on the insight that people everywhere are connected through in-person social networks, such as kin, friendship, and contact networks. We illustrate how this insight can be used to derive an estimator for tracking the *digital divide* in access to the internet, an increasingly important dimension of population inequality in the modern world. We conducted a large-scale empirical test of our approach, using an online sample to estimate internet adoption in five countries (). Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsICT Impact and Policies · Human Mobility and Location-Based Analysis · Social Media and Politics
