RoIPoly: Vectorized Building Outline Extraction Using Vertex and Logit Embeddings
Weiqin Jiao, Hao Cheng, Claudio Persello, George Vosselman

TL;DR
RoIPoly is a novel vectorized building outline extraction method that uses vertex queries and logit embeddings to improve efficiency and reduce redundancy, outperforming existing approaches on multiple datasets.
Contribution
The paper introduces RoIPoly, a region-of-interest query-based approach with learnable logit embeddings for efficient, redundant-free polygon extraction from aerial imagery.
Findings
Outperforms existing methods on the CrowdAI dataset, especially for small buildings.
Achieves second-best performance on 2D floorplan reconstruction in Structured3D.
Reduces computational overhead and eliminates post-processing for redundancy.
Abstract
Polygonal building outlines are crucial for geographic and cartographic applications. The existing approaches for outline extraction from aerial or satellite imagery are typically decomposed into subtasks, e.g., building masking and vectorization, or treat this task as a sequence-to-sequence prediction of ordered vertices. The former lacks efficiency, and the latter often generates redundant vertices, both resulting in suboptimal performance. To handle these issues, we propose a novel Region-of-Interest (RoI) query-based approach called RoIPoly. Specifically, we formulate each vertex as a query and constrain the query attention on the most relevant regions of a potential building, yielding reduced computational overhead and more efficient vertex level interaction. Moreover, we introduce a novel learnable logit embedding to facilitate vertex classification on the attention map; thus, no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Image Retrieval and Classification Techniques · Remote Sensing and Land Use
MethodsSoftmax · Attention Is All You Need
