TL;DR
ArchComplete introduces a hierarchical diffusion-based pipeline for 3D architectural design generation, capturing complex geometries and details at high resolutions through autoregressive and diffusion models.
Contribution
The paper presents a novel two-stage voxel-based generative pipeline combining autoregressive transformers and hierarchical diffusion models for detailed 3D architectural modeling.
Findings
Achieves high-resolution 3D models up to 512^3 voxels.
State-of-the-art quality and diversity in architectural generation.
Efficiently generates detailed 3D house models.
Abstract
Recent advances in 3D generative models have shown promising results but often fall short in capturing the complexity of architectural geometries and topologies and fine geometric details at high resolutions. To tackle this, we present ArchComplete, a two-stage voxel-based 3D generative pipeline consisting of a vector-quantised model, whose composition is modelled with an autoregressive transformer for generating coarse shapes, followed by a hierarchical upsampling strategy for further enrichment with fine structures and details. Key to our pipeline is (i) learning a contextually rich codebook of local patch embeddings, optimised alongside a 2.5D perceptual loss that captures global spatial correspondence of projections onto three axis-aligned orthogonal planes, and (ii) redefining upsampling as a set of conditional diffusion models learning from a hierarchy of randomly cropped…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Diffusion
