Impact on Public Health Decision Making by Utilizing Big Data Without Domain Knowledge
Miao Zhang, Salman Rahman, Vishwali Mhasawade, Rumi Chunara

TL;DR
This study examines the use of big data, specifically street view imagery and health data from NYC, to inform public health decisions, highlighting challenges of data bias and robustness in AI-driven analysis.
Contribution
It demonstrates the limitations of using street view imagery for health-related built environment assessments and introduces a causal framework to evaluate intervention impacts.
Findings
Built environment features inferred from GSV may not align with ground truth.
Physical inactivity mediates the impact of environment features on health.
Targeted interventions based on robust analysis can significantly reduce obesity and diabetes prevalence.
Abstract
New data sources, and artificial intelligence (AI) methods to extract information from them are becoming plentiful, and relevant to decision making in many societal applications. An important example is street view imagery, available in over 100 countries, and considered for applications such as assessing built environment aspects in relation to community health outcomes. Relevant to such uses, important examples of bias in the use of AI are evident when decision-making based on data fails to account for the robustness of the data, or predictions are based on spurious correlations. To study this risk, we utilize 2.02 million GSV images along with health, demographic, and socioeconomic data from New York City. Initially, we demonstrate that built environment characteristics inferred from GSV labels at the intra-city level may exhibit inadequate alignment with the ground truth. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Technology and Data Analysis
