An Analysis of Human-centered Geolocation
Kaili Wang, Yu-Hui Huang, Jose Oramas, Luc Van Gool, Tinne Tuytelaars

TL;DR
This paper explores how visual cues from people's appearances and surroundings in social media images can be used to accurately determine the geographic location, demonstrating the effectiveness of convolutional neural networks over human judgment.
Contribution
It introduces an automatic method using CNNs for geolocation based on human-centered visual cues and compares its performance to human capabilities.
Findings
Automatic methods outperform humans in geolocation accuracy.
Clothing style and physical features are key indicators for location.
Contextual features like environment also contribute significantly.
Abstract
Online social networks contain a constantly increasing amount of images - most of them focusing on people. Due to cultural and climate factors, fashion trends and physical appearance of individuals differ from city to city. In this paper we investigate to what extent such cues can be exploited in order to infer the geographic location, i.e. the city, where a picture was taken. We conduct a user study, as well as an evaluation of automatic methods based on convolutional neural networks. Experiments on the Fashion 144k and a Pinterest-based dataset show that the automatic methods succeed at this task to a reasonable extent. As a matter of fact, our empirical results suggest that automatic methods can surpass human performance by a large margin. Further inspection of the trained models shows that human-centered characteristics, like clothing style, physical features, and accessories, are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
