HiT: Building Mapping with Hierarchical Transformers
Mingming Zhang, Qingjie Liu, Yunhong Wang

TL;DR
HiT introduces a hierarchical transformer-based approach for building mapping from high-resolution remote sensing images, producing accurate vector polygons and bounding boxes in an end-to-end manner, outperforming existing methods.
Contribution
The paper presents a novel end-to-end building mapping method using Hierarchical Transformers with a polygon head that predicts serialized vertices, improving accuracy and simplicity over traditional multi-step approaches.
Findings
Achieves state-of-the-art results on CrowdAI and Inria datasets.
Outperforms existing methods in instance segmentation and polygonal metrics.
Qualitative results show robustness in complex scenes.
Abstract
Deep learning-based methods have been extensively explored for automatic building mapping from high-resolution remote sensing images over recent years. While most building mapping models produce vector polygons of buildings for geographic and mapping systems, dominant methods typically decompose polygonal building extraction in some sub-problems, including segmentation, polygonization, and regularization, leading to complex inference procedures, low accuracy, and poor generalization. In this paper, we propose a simple and novel building mapping method with Hierarchical Transformers, called HiT, improving polygonal building mapping quality from high-resolution remote sensing images. HiT builds on a two-stage detection architecture by adding a polygon head parallel to classification and bounding box regression heads. HiT simultaneously outputs building bounding boxes and vector polygons,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Video Surveillance and Tracking Methods · Remote-Sensing Image Classification
MethodsConvolution
