From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks
Jacob Sam, Karan Patel, Mike Saad

TL;DR
This paper compares four advanced neural network-based methods for converting 2D architectural plans and images into detailed 3D, 2.5D, and 1.5D models, highlighting their effectiveness in rapid visualization and design iteration.
Contribution
It provides a comprehensive evaluation of four novel methods for 3D model generation from 2D inputs, emphasizing their applicability in architecture and visualization.
Findings
All methods improve speed of 3D model creation.
Each method offers unique advantages in accuracy and texture fidelity.
The study guides selection of methods based on project needs.
Abstract
In the field of architecture, the conversion of single images into 2 and 1/2D and 3D meshes is a promising technology that enhances design visualization and efficiency. This paper evaluates four innovative methods: "One-2-3-45," "CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model," "Instant Mesh," and "Image-to-Mesh." These methods are at the forefront of this technology, focusing on their applicability in architectural design and visualization. They streamline the creation of 3D architectural models, enabling rapid prototyping and detailed visualization from minimal initial inputs, such as photographs or simple sketches.One-2-3-45 leverages a diffusion-based approach to generate multi-view reconstructions, ensuring high geometric fidelity and texture quality. CRM utilizes a convolutional network to integrate geometric priors into its architecture, producing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · 3D Modeling in Geospatial Applications
MethodsSparse Evolutionary Training · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
