# A study on the refined population spatialization method integrating multi-source data: A case study of Yuxi city

**Authors:** Fanghao Zhang, Feirui Jiang, Yuanshuo Zhang, Junzu Xu, Yang Ye, Zhaoliang Jia

PMC · DOI: 10.1371/journal.pone.0340430 · PLOS One · 2026-01-23

## TL;DR

This study improves population distribution mapping in Yuxi City using advanced machine learning and multi-source data, achieving higher accuracy than existing datasets.

## Contribution

A novel Stacking ensemble model integrating social sensing, remote sensing, and building data for refined population spatialization.

## Key findings

- The proposed model outperforms WorldPop and LandScan by over 40% in administrative village-scale accuracy.
- Residential building area, POI, and nighttime light intensity are key indicators of population distribution in Yuxi City.

## Abstract

The precise assessment of earthquake disasters requires high accuracy in population spatialization data. With the rapid development of technologies such as remote sensing and social sensing (social perception), and the growing availability of large-scale geographic data, high-resolution remote sensing data and vast geographical spatial data provide new opportunities for inferring more accurate population data. This paper proposes a method that integrates social perception big data, building attribute data, and multi-source remote sensing data. Using Random Forest, XGBoost, LGBM, Gradient Boosting, AdaBoost, and CatBoost models as base learners, and RidgeCV regression as the secondary learner, a higher-performing Stacking ensemble learning prediction model is constructed. The method system is employed to spatialize the population data of Yuxi City for the year 2020 at a 50m resolution using the Partition Density Mapping technique. The results show that: (1) The population estimation model proposed in this paper outperforms the WorldPop dataset and LandScan dataset by more than 40% in terms of accuracy at the administrative village scale, providing higher simulation accuracy for regions with different population densities. This result highlights the refined spatial features of the population depicted in satellite remote sensing images, offering richer and more realistic population distribution information. (2) By measuring the SHAP feature contributions and importance of the variables, it is found that residential building area, POI, and nighttime light intensity are the most significant indicators of population distribution in Yuxi City, suggesting a higher correlation of social factors in population spatial redistribution.

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** GBM (MESH:D005910), POI (MESH:C000719195), LBSNs (MESH:D019292)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12829872/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12829872/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12829872/full.md

---
Source: https://tomesphere.com/paper/PMC12829872