CityVerse: A Unified Data Platform for Multi-Task Urban Computing with Large Language Models
Yaqiao Zhu, Hongkai Wen, Mark Birkin, Man Luo

TL;DR
CityVerse is a comprehensive platform that unifies urban data and tasks to systematically evaluate large language models' capabilities in urban computing, facilitating fair comparison and reproducibility.
Contribution
It introduces the first unified platform with integrated urban data, a structured task taxonomy, and simulation tools for evaluating LLMs in urban contexts.
Findings
Effective evaluation of mainstream LLMs across diverse urban tasks.
Demonstrates the platform's ability to support reproducible assessments.
Facilitates systematic comparison of LLM capabilities in urban computing.
Abstract
Large Language Models (LLMs) show remarkable potential for urban computing, from spatial reasoning to predictive analytics. However, evaluating LLMs across diverse urban tasks faces two critical challenges: lack of unified platforms for consistent multi-source data access and fragmented task definitions that hinder fair comparison. To address these challenges, we present CityVerse, the first unified platform integrating multi-source urban data, capability-based task taxonomy, and dynamic simulation for systematic LLM evaluation in urban contexts. CityVerse provides: 1) coordinate-based Data APIs unifying ten categories of urban data-including spatial features, temporal dynamics, demographics, and multi-modal imagery-with over 38 million curated records; 2) Task APIs organizing 43 urban computing tasks into a four-level cognitive hierarchy: Perception, Spatial Understanding, Reasoning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Smart Cities and Technologies · Multimodal Machine Learning Applications
