UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models
Timo Kapsalis

TL;DR
UrbanGenAI combines advanced image segmentation and diffusion models to enhance urban landscape reconstruction, offering a novel, high-performance tool that supports education and community planning in urban design.
Contribution
It introduces a new workflow integrating panoptic segmentation and diffusion models for urban landscape reconstruction, with a prototype demonstrating high accuracy and practical applications.
Findings
High accuracy in object detection and text-to-image generation
Effective use in educational and participatory urban planning contexts
Potential for real-time feedback and 3D integration
Abstract
In contemporary design practices, the integration of computer vision and generative artificial intelligence (genAI) represents a transformative shift towards more interactive and inclusive processes. These technologies offer new dimensions of image analysis and generation, which are particularly relevant in the context of urban landscape reconstruction. This paper presents a novel workflow encapsulated within a prototype application, designed to leverage the synergies between advanced image segmentation and diffusion models for a comprehensive approach to urban design. Our methodology encompasses the OneFormer model for detailed image segmentation and the Stable Diffusion XL (SDXL) diffusion model, implemented through ControlNet, for generating images from textual descriptions. Validation results indicated a high degree of performance by the prototype application, showcasing significant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLand Use and Ecosystem Services · Remote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage
MethodsDiffusion · Contrastive Language-Image Pre-training
