Urban-Semantic Computer Vision: A Framework for Contextual Understanding of People in Urban Spaces
Anthony Vanky, Ri Le

TL;DR
This paper critiques current AI and computer vision applications in urban spaces for lacking contextual depth and proposes a new framework inspired by cultural interpretation to improve semantic understanding of urban environments.
Contribution
It introduces a novel framework based on interpretive theories to evaluate and enhance the contextual understanding of AI in urban spaces, addressing current limitations.
Findings
Current AI applications lack urban context and depth.
Proposes three methodologies for urban-semantic description.
Highlights conflicts in technology use among different urban stakeholders.
Abstract
Increasing computational power and improving deep learning methods have made computer vision technologies pervasively common in urban environments. Their applications in policing, traffic management, and documenting public spaces are increasingly common. Despite the often-discussed biases in the algorithms' training and unequally borne benefits, almost all applications similarly reduce urban experiences to simplistic, reductive, and mechanistic measures. There is a lack of context, depth, and specificity in these practices that enables semantic knowledge or analysis within urban contexts, especially within the context of using and occupying urban space. This paper will critique existing uses of artificial intelligence and computer vision in urban practices to propose a new framework for understanding people, action, and public space. This paper revisits Geertz's use of thick…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Video Surveillance and Tracking Methods
