Extracting the U.S. building types from OpenStreetMap data
Henrique F. de Arruda, Sandro M. Reia, Shiyang Ruan, Kuldip S. Atwal,, Hamdi Kavak, Taylor Anderson, Dieter Pfoser

TL;DR
This paper presents a comprehensive U.S. building type dataset derived from OpenStreetMap data, using an unsupervised classification method validated against authoritative sources, aiding urban planning and related fields.
Contribution
It introduces a novel large-scale classification of U.S. buildings into residential and non-residential types using unsupervised learning on OSM data, covering over 67 million buildings.
Findings
High precision for non-residential building classification
High recall for residential buildings
Identified data quality issues due to missing metadata
Abstract
Building type information is crucial for population estimation, traffic planning, urban planning, and emergency response applications. Although essential, such data is often not readily available. To alleviate this problem, this work creates a comprehensive dataset by providing residential/non-residential building classification covering the entire United States. We propose and utilize an unsupervised machine learning method to classify building types based on building footprints and available OpenStreetMap information. The classification result is validated using authoritative ground truth data for select counties in the U.S. The validation shows a high precision for non-residential building classification and a high recall for residential buildings. We identified various approaches to improving the quality of the classification, such as removing sheds and garages from the dataset.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · Automated Road and Building Extraction · Remote Sensing and Land Use
