World-POI: Global Point-of-Interest Data Enriched from Foursquare and OpenStreetMap as Tabular and Graph Data
Hossein Amiri, Mohammad Hashemi, Andreas Z\"ufle

TL;DR
This paper introduces a methodology to integrate Foursquare and OpenStreetMap POI datasets, creating a rich, combined dataset with enhanced metadata and a graph-based structure for advanced spatial analysis.
Contribution
It presents a novel data integration approach combining Foursquare and OSM POIs, with a scalable, reproducible process and a graph-based data representation.
Findings
Combined dataset of approximately 1 TB, with filtered releases for practical use.
High-confidence matches identified through name similarity and spatial proximity.
Enables advanced spatial analysis and downstream applications.
Abstract
Recently, Foursquare released a global dataset with more than 100 million points of interest (POIs), each representing a real-world business on its platform. However, many entries lack complete metadata such as addresses or categories, and some correspond to non-existent or fictional locations. In contrast, OpenStreetMap (OSM) offers a rich, user-contributed POI dataset with detailed and frequently updated metadata, though it does not formally verify whether a POI represents an actual business. In this data paper, we present a methodology that integrates the strengths of both datasets: Foursquare as a comprehensive baseline of commercial POIs and OSM as a source of enriched metadata. The combined dataset totals approximately 1 TB. While this full version is not publicly released, we provide filtered releases with adjustable thresholds that reduce storage needs and make the data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
