Modeling Subjective Urban Perception with Human Gaze
Lin Che, Xi Wang, Marc Pollefeys, Konrad Schindler, Martin Raubal, Peter Kiefer

TL;DR
This paper introduces a new dataset and framework for modeling subjective urban perception by incorporating human gaze data, demonstrating that gaze signals enhance urban scene understanding.
Contribution
It presents Place Pulse-Gaze, a novel dataset with eye-tracking data, and a Gaze-Guided Urban Perception Framework that improves perception modeling by integrating gaze with visual scene representations.
Findings
Gaze alone provides useful signals for urban perception prediction.
Combining gaze with scene representations improves prediction accuracy.
Gaze-guided models outperform gaze-only models in subjective urban perception tasks.
Abstract
Urban perception describes how people subjectively evaluate urban environments, shaping how cities are experienced and understood. Existing computational approaches primarily model urban perception directly from street view images, but largely ignore the human perceptual process through which such judgments are formed. In this paper, we introduce Place Pulse-Gaze, an urban perception dataset that augments street view images with synchronized eye-tracking recordings and individual perception labels. Based on this dataset, we propose a Gaze-Guided Urban Perception Framework to study how gaze behavior contributes to the modeling of subjective urban perception. The framework systematically investigates three complementary settings: gaze-only modeling, gaze fusion with explicit semantic scene representations, and gaze fusion with implicit richer visual representations. Experiments show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
