Diagnosing Urban Street Vitality via a Visual-Semantic and Spatiotemporal Framework for Street-Level Economics
Xinxin Zhuo, Mengyuan Niu, Ruizhe Wang, Junyan Yang, Qiao Wang

TL;DR
This paper introduces a novel visual-semantic and spatiotemporal framework, the Street Economic Vitality Index (SEVI), for assessing urban street vitality through integrated imagery analysis and demand modeling.
Contribution
It develops a comprehensive diagnostic system combining physical and semantic streetscape parsing with temporal demand analysis, addressing limitations of static Street View Imagery.
Findings
Quasi-causal spatiotemporal heterogeneity in street vibrancy identified.
Brand hierarchy and mall externalities influence street vitality patterns.
Interfaces and structural recession impact attraction and repulsion effects at different times.
Abstract
Micro-scale street-level economic assessment is fundamental for precision spatial resource allocation. While Street View Imagery (SVI) advances urban sensing, existing approaches remain semantically superficial and overlook brand hierarchy heterogeneity and structural recession. To address this, we propose a visual-semantic and field-based spatiotemporal framework, operationalized via the Street Economic Vitality Index (SEVI). Our approach integrates physical and semantic streetscape parsing through instance segmentation of signboards, glass interfaces, and storefront closures. A dual-stage VLM-LLM pipeline standardizes signage into global hierarchies to quantify a spatially smoothed brand premium index. To overcome static SVI limitations, we introduce a temporal lag design using Location-Based Services (LBS) data to capture realized demand. Combined with a category-weighted Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
