TL;DR
Raster2Seq is a novel sequence-to-sequence approach for reconstructing detailed, structured floorplan vector graphics from raster images, effectively handling complex indoor layouts with many rooms and diverse geometries.
Contribution
It introduces an autoregressive decoder with learnable anchors for flexible, accurate polygon sequence generation from floorplan images, advancing state-of-the-art performance.
Findings
Achieves state-of-the-art results on Structure3D, CubiCasa5K, and Raster2Graph datasets.
Demonstrates strong generalization to diverse and complex datasets like WAFFLE.
Abstract
Reconstructing a structured vector-graphics representation from a rasterized floorplan image is typically an important prerequisite for computational tasks involving floorplans such as automated understanding or CAD workflows. However, existing techniques struggle in faithfully generating the structure and semantics conveyed by complex floorplans that depict large indoor spaces with many rooms and a varying numbers of polygon corners. To this end, we propose Raster2Seq, framing floorplan reconstruction as a sequence-to-sequence task in which floorplan elements--such as rooms, windows, and doors--are represented as labeled polygon sequences that jointly encode geometry and semantics. Our approach introduces an autoregressive decoder that learns to predict the next corner conditioned on image features and previously generated corners using guidance from learnable anchors. These anchors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
