Beyond Segmentation: Structurally Informed Facade Parsing from Imperfect Images

Maciej Janicki; Aleksander Plocharski; Przemyslaw Musialski

arXiv:2604.09260·cs.CV·April 13, 2026

Beyond Segmentation: Structurally Informed Facade Parsing from Imperfect Images

Maciej Janicki, Aleksander Plocharski, Przemyslaw Musialski

PDF

TL;DR

This paper enhances facade parsing by integrating a lightweight alignment loss into YOLOv8, improving structural regularity and geometric coherence in architectural element detection from imperfect images.

Contribution

It introduces a novel alignment loss to YOLOv8 training that enforces grid consistency, improving structural coherence without changing the inference process.

Findings

01

Improved structural regularity in facade parsing results.

02

Corrected alignment errors due to perspective and occlusion.

03

Maintained detection accuracy while enhancing geometric coherence.

Abstract

Standard object detectors typically treat architectural elements independently, often resulting in facade parsings that lack the structural coherence required for downstream procedural reconstruction. We address this limitation by augmenting the YOLOv8 training objective with a custom lightweight alignment loss. This regularization encourages grid-consistent arrangements of bounding boxes during training, effectively injecting geometric priors without altering the standard inference pipeline. Experiments on the CMP dataset demonstrate that our method successfully improves structural regularity, correcting alignment errors caused by perspective and occlusion while maintaining a controllable trade-off with standard detection accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.