StreetLens: Enabling Human-Centered AI Agents for Neighborhood Assessment from Street View Imagery
Jina Kim, Leeje Jang, Yao-Yi Chiang, Guanyu Wang, Michelle C. Pasco

TL;DR
StreetLens is a configurable AI workflow that combines vision language models with social science expertise to automate and scale neighborhood assessments from street view imagery, improving flexibility and robustness.
Contribution
It introduces a human-centered, domain-informed VLM framework for neighborhood analysis that adapts to diverse research needs and integrates prior survey data.
Findings
Automates neighborhood characteristic assessments from street view images.
Supports subjective and objective environmental annotations.
Enhances scalability and adaptability of neighborhood studies.
Abstract
Traditionally, neighborhood studies have used interviews, surveys, and manual image annotation guided by detailed protocols to identify environmental characteristics, including physical disorder, decay, street safety, and sociocultural symbols, and to examine their impact on developmental and health outcomes. Although these methods yield rich insights, they are time-consuming and require intensive expert intervention. Recent technological advances, including vision language models (VLMs), have begun to automate parts of this process; however, existing efforts are often ad hoc and lack adaptability across research designs and geographic contexts. In this paper, we present StreetLens, a user-configurable human-centered workflow that integrates relevant social science expertise into a VLM for scalable neighborhood environmental assessments. StreetLens mimics the process of trained human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Human Mobility and Location-Based Analysis · Video Surveillance and Tracking Methods
MethodsHigh-Order Consensuses
