Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach
Soheil Hashtarkhani, Yiwang Zhou, Fekede Asefa Kumsa, Shelley, White-Means, David L Schwartz, Arash Shaban-Nejad

TL;DR
This study uses machine learning and spatial analysis to identify disparities in breast cancer screening across US populations, highlighting social determinants influencing screening rates.
Contribution
It introduces a comprehensive dataset and applies advanced machine learning techniques to analyze social and spatial factors affecting screening disparities.
Findings
Higher screening rates in eastern and northern US regions.
Random forest outperformed other models in predicting screening rates.
Key influential factors include racial demographics, facility access, and education levels.
Abstract
Breast cancer screening plays a pivotal role in early detection and subsequent effective management of the disease, impacting patient outcomes and survival rates. This study aims to assess breast cancer screening rates nationwide in the United States and investigate the impact of social determinants of health on these screening rates. Data on mammography screening at the census tract level for 2018 and 2020 were collected from the Behavioral Risk Factor Surveillance System. We developed a large dataset of social determinants of health, comprising 13 variables for 72337 census tracts. Spatial analysis employing Getis-Ord Gi statistics was used to identify clusters of high and low breast cancer screening rates. To evaluate the influence of these social determinants, we implemented a random forest model, with the aim of comparing its performance to linear regression and support vector…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Regression
