GeoSEE: Regional Socio-Economic Estimation With a Large Language Model
Sungwon Han, Donghyun Ahn, Seungeon Lee, Minhyuk Song, Sungwon Park,, Sangyoon Park, Jihee Kim, Meeyoung Cha

TL;DR
GeoSEE leverages a large language model to integrate diverse data sources for accurate, cost-effective socio-economic estimation across countries, especially benefiting data-scarce regions and supporting global development goals.
Contribution
The paper introduces GeoSEE, a novel LLM-based framework that intelligently selects data modules and estimates socio-economic indicators across countries, outperforming existing models in low-data settings.
Findings
Outperforms other models in various countries and data conditions.
Effective in low-resource and under-developed country contexts.
Supports monitoring of Sustainable Development Goals.
Abstract
Moving beyond traditional surveys, combining heterogeneous data sources with AI-driven inference models brings new opportunities to measure socio-economic conditions, such as poverty and population, over expansive geographic areas. The current research presents GeoSEE, a method that can estimate various socio-economic indicators using a unified pipeline powered by a large language model (LLM). Presented with a diverse set of information modules, including those pre-constructed from satellite imagery, GeoSEE selects which modules to use in estimation, for each indicator and country. This selection is guided by the LLM's prior socio-geographic knowledge, which functions similarly to the insights of a domain expert. The system then computes target indicators via in-context learning after aggregating results from selected modules in the format of natural language-based texts. Comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis
