Census tract-level socioeconomic variables and breast cancer characteristics and outcomes in California and New York State
Margaret Gates Kuliszewski, Baozhen Qiao, Mandi Yu, Maria J. Schymura, Tabassum Insaf

TL;DR
Living in disadvantaged areas is linked to worse breast cancer outcomes, and synthetic data can approximate real data for analysis.
Contribution
The study demonstrates that synthetic census tract data can be used to analyze cancer outcomes without compromising privacy.
Findings
Greater socioeconomic disadvantage is associated with more advanced breast cancer stages and poorer survival.
Synthetic data results for California were consistent with actual data in direction and significance.
Synthetic data overestimated associations with stage but underestimated those with grade and subtype.
Abstract
Synthetic census tracts can allow for release of small area cancer data without compromising patient confidentiality. We used synthetic and actual census tract data for California and actual data for New York State (NYS) to examine associations of small area socioeconomic factors with breast cancer prognosis and outcomes and to evaluate results obtained from synthetic versus actual data. We retrieved data on invasive, first primary breast cancers diagnosed between 2006 and 2017 in females ages ≥ 18 in California (n = 237,156) or NYS (n = 149,789). We categorized into quintiles census tract-level exposures and used multivariable-adjusted multilevel logistic and Cox proportional hazards regression analyses to examine associations with stage, grade, subtype, and overall and cancer-specific survival. We conducted separate analyses for California and NYS and compared results from the two…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGlobal Cancer Incidence and Screening · Data-Driven Disease Surveillance · Health, Environment, Cognitive Aging
