Mask-to-Height: A YOLOv11-Based Architecture for Joint Building Instance Segmentation and Height Classification from Satellite Imagery
Mahmoud El Hussieni, Bahad{\i}r K. G\"unt\"urk, Hasan F. Ate\c{s}, O\u{g}uz Hano\u{g}lu

TL;DR
This paper introduces YOLOv11-based architecture for joint building segmentation and height classification from satellite images, demonstrating improved accuracy and speed over previous models for urban mapping tasks.
Contribution
The paper presents a novel YOLOv11-based model that effectively combines building instance segmentation with height classification in satellite imagery, enhancing performance in complex urban scenes.
Findings
Achieves 60.4% mAP@50 for building detection
Maintains robust height classification accuracy across five tiers
Outperforms earlier frameworks in detection speed and accuracy
Abstract
Accurate building instance segmentation and height classification are critical for urban planning, 3D city modeling, and infrastructure monitoring. This paper presents a detailed analysis of YOLOv11, the recent advancement in the YOLO series of deep learning models, focusing on its application to joint building extraction and discrete height classification from satellite imagery. YOLOv11 builds on the strengths of earlier YOLO models by introducing a more efficient architecture that better combines features at different scales, improves object localization accuracy, and enhances performance in complex urban scenes. Using the DFC2023 Track 2 dataset -- which includes over 125,000 annotated buildings across 12 cities -- we evaluate YOLOv11's performance using metrics such as precision, recall, F1 score, and mean average precision (mAP). Our findings demonstrate that YOLOv11 achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Remote Sensing and LiDAR Applications
