# 4D+ City Sidewalk: Integrating Pedestrian View into Sidewalk Spaces to Support User-Centric Urban Spatial Perception

**Authors:** Jinjing Zhao, Yunfan Chen, Yancheng Li, Haotian Xu, Jingjing Xu, Xuliang Li, Hong Zhang, Lei Jin, Shengyong Xu

PMC · DOI: 10.3390/s25051375 · 2025-02-24

## TL;DR

This paper presents a system that combines user and CCTV views to create a 4D+ visualization of sidewalk environments for better urban spatial understanding.

## Contribution

A scalable spatial visualization system integrating first-person and CCTV data for user-centric sidewalk monitoring.

## Key findings

- Landmark-based localization achieves 0.468 m planar and 0.120 m height error on average.
- CCTV-assisted target localization maintains an overall error of 0.24 m.
- The system supports real-time spatiotemporal visualization of sidewalk environments.

## Abstract

As urban environments become increasingly interconnected, the demand for precise and efficient pedestrian solutions in digitalized smart cities has grown significantly. This study introduces a scalable spatial visualization system designed to enhance interactions between individuals and the street in outdoor sidewalk environments. The system operates in two main phases: the spatial prior phase and the target localization phase. In the spatial prior phase, the system captures the user’s perspective using first-person visual data and leverages landmark elements within the sidewalk environment to localize the user’s camera. In the target localization phase, the system detects surrounding objects, such as pedestrians or cyclists, using high-angle closed-circuit television (CCTV) cameras. The system was deployed in a real-world sidewalk environment at an intersection on a university campus. By combining user location data with CCTV observations, a 4D+ virtual monitoring system was developed to present a spatiotemporal visualization of the mobile participants within the user’s surrounding sidewalk space. Experimental results show that the landmark-based localization method achieves a planar positioning error of 0.468 m and a height error of 0.120 m on average. With the assistance of CCTV cameras, the localization of other targets maintains an overall error of 0.24 m. This system establishes the spatial relationship between pedestrians and the street by integrating detailed sidewalk views, with promising applications for pedestrian navigation and the potential to enhance pedestrian-friendly urban ecosystems.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11902832/full.md

---
Source: https://tomesphere.com/paper/PMC11902832