Privacy Protection, Measurement Error, and the Integration of Remote Sensing and Socioeconomic Survey Data
Jeffrey D. Michler, Anna Josephson, Talip Kilic, Siobhan Murray

TL;DR
This study examines how privacy-preserving spatial anonymization in large-scale socioeconomic surveys affects measurement accuracy when integrating with remote sensing weather data, finding limited impact on estimates but highlighting the importance of weather data choice.
Contribution
It provides a systematic analysis of measurement error introduced by spatial anonymization methods in survey data integration with remote sensing weather data.
Findings
Spatial anonymization has limited impact on weather-agriculture estimates.
The choice of remote sensing weather product influences measurement error.
Careful selection of weather data is crucial for accurate analysis.
Abstract
When publishing socioeconomic survey data, survey programs implement a variety of statistical methods designed to preserve privacy but which come at the cost of distorting the data. We explore the extent to which spatial anonymization methods to preserve privacy in the large-scale surveys supported by the World Bank Living Standards Measurement Study - Integrated Surveys on Agriculture (LSMS-ISA) introduce measurement error in econometric estimates when that survey data is integrated with remote sensing weather data. Guided by a pre-analysis plan, we produce 90 linked weather-household datasets that vary by the spatial anonymization method and the remote sensing weather product. By varying the data along with the econometric model we quantify the magnitude and significance of measurement error coming from the loss of accuracy that results from protect privacy measures. We find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial and Panel Data Analysis · Data-Driven Disease Surveillance · COVID-19 epidemiological studies
