CityCraft: A Real Crafter for 3D City Generation
Jie Deng, Wenhao Chai, Junsheng Huang, Zhonghan Zhao, Qixuan Huang,, Mingyan Gao, Jianshu Guo, Shengyu Hao, Wenhao Hu, Jenq-Neng Hwang, Xi Li,, Gaoang Wang

TL;DR
CityCraft introduces a novel framework for generating diverse, realistic 3D city scenes by combining diffusion transformers, language-guided planning, and asset retrieval, advancing urban scene synthesis for applications like autonomous driving and city planning.
Contribution
The paper presents a new multi-stage approach integrating diffusion transformers, language models, and asset placement, along with two datasets, to improve diversity and realism in 3D city generation.
Findings
Achieves state-of-the-art realism in 3D city generation
Provides controllable and diverse urban scene layouts
Introduces two new datasets for urban scene synthesis
Abstract
City scene generation has gained significant attention in autonomous driving, smart city development, and traffic simulation. It helps enhance infrastructure planning and monitoring solutions. Existing methods have employed a two-stage process involving city layout generation, typically using Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), or Transformers, followed by neural rendering. These techniques often exhibit limited diversity and noticeable artifacts in the rendered city scenes. The rendered scenes lack variety, resembling the training images, resulting in monotonous styles. Additionally, these methods lack planning capabilities, leading to less realistic generated scenes. In this paper, we introduce CityCraft, an innovative framework designed to enhance both the diversity and quality of urban scene generation. Our approach integrates three key stages:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAugmented Reality Applications · 3D Shape Modeling and Analysis · Advanced Manufacturing and Logistics Optimization
MethodsSoftmax · RoIAlign · Diffusion · RoIPool
