WAFFLE: Multimodal Floorplan Understanding in the Wild
Keren Ganon, Morris Alper, Rachel Mikulinsky, Hadar Averbuch-Elor

TL;DR
WAFFLE introduces a large, diverse multimodal dataset of nearly 20,000 floorplans and metadata, enabling new research in building understanding tasks that were previously infeasible.
Contribution
The paper presents WAFFLE, a comprehensive multimodal dataset for floorplan understanding, and demonstrates its utility for advancing building semantics analysis.
Findings
WAFFLE enables new discriminative and generative building understanding tasks.
The dataset covers diverse building types, locations, and data formats.
The approach uses large language and multimodal models for semantic extraction.
Abstract
Buildings are a central feature of human culture and are increasingly being analyzed with computational methods. However, recent works on computational building understanding have largely focused on natural imagery of buildings, neglecting the fundamental element defining a building's structure -- its floorplan. Conversely, existing works on floorplan understanding are extremely limited in scope, often focusing on floorplans of a single semantic category and region (e.g. floorplans of apartments from a single country). In this work, we introduce WAFFLE, a novel multimodal floorplan understanding dataset of nearly 20K floorplan images and metadata curated from Internet data spanning diverse building types, locations, and data formats. By using a large language model and multimodal foundation models, we curate and extract semantic information from these images and their accompanying noisy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · Speech and dialogue systems · Semantic Web and Ontologies
