Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Yao Sun, Anna Kruspe, Liqiu Meng, Yifan Tian, Eike J Hoffmann, Stefan, Auer, Xiao Xiang Zhu

TL;DR
This paper explores the use of scene text recognition on crowdsourced Flickr images to map building attributes, highlighting challenges and proposing differentiated approaches for city-wide applications.
Contribution
It introduces a new dataset and analysis of STR effectiveness on Flickr images, addressing challenges and suggesting tailored solutions for large-scale building attribute mapping.
Findings
High accuracy of STR on selected images
Correlation between recognized texts and building functions
Identified challenges like small text regions and data mismatches
Abstract
Crowdsourced platforms provide huge amounts of street-view images that contain valuable building information. This work addresses the challenges in applying Scene Text Recognition (STR) in crowdsourced street-view images for building attribute mapping. We use Flickr images, particularly examining texts on building facades. A Berlin Flickr dataset is created, and pre-trained STR models are used for text detection and recognition. Manual checking on a subset of STR-recognized images demonstrates high accuracy. We examined the correlation between STR results and building functions, and analysed instances where texts were recognized on residential buildings but not on commercial ones. Further investigation revealed significant challenges associated with this task, including small text regions in street-view images, the absence of ground truth labels, and mismatches in buildings in Flickr…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · Automated Road and Building Extraction · Advanced Image and Video Retrieval Techniques
