# Cancer incidence data at the ZIP Code Tabulation Area level in the United States interpolated by Monte Carlo simulation with multiple constraints

**Authors:** Lingbo Liu, Fahui Wang, Tracy Onega

PMC · DOI: 10.1038/s41597-025-05254-8 · Scientific Data · 2025-05-30

## TL;DR

This study creates detailed cancer incidence data for small U.S. geographic areas using a simulation method, improving public health research and interventions.

## Contribution

A novel multi-constraint Monte Carlo simulation framework to reconstruct and disaggregate suppressed cancer data to ZIP Code Tabulation Areas.

## Key findings

- The method reconstructs suppressed county-level cancer data using demographic and macro-level incidence constraints.
- The dataset enables high-resolution spatial analyses and precision public health interventions at multiple geographic scales.
- The approach ensures consistency and reliability of cancer incidence data across different spatial levels.

## Abstract

High-quality cancer data are fundamental for public health research and policy, but cancer data for small geographic units and population subgroups in the United States are rarely available due to small-sample suppression rules, spatial coarsening, and data incompleteness. These limitations hinder high-resolution spatial analyses and precision public health interventions. This study provides a high-resolution cancer incidence dataset for the U.S., generated through a multi-constraint Monte Carlo simulation framework that reconstructs suppressed county-level cancer data and systematically disaggregates them to ZIP Code Tabulation Areas (ZCTAs), guided by demographic constraints. This method integrates population subgroup structures and macro-level incidence rates as constraints, ensuring consistency and reliability across spatial scales. The resulting dataset spans multiple geographic units, from state and county levels to ZCTAs, enabling comprehensive analyses of cancer burden, in-depth spatial analyses, and precision public health interventions across multiple scales.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Diseases:** Cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12125315/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12125315/full.md

## References

3 references — full list in the complete paper: https://tomesphere.com/paper/PMC12125315/full.md

---
Source: https://tomesphere.com/paper/PMC12125315