Sketch2CAD: 3D CAD Model Reconstruction from 2D Sketch using Visual   Transformer

Hong-Bin Yang

arXiv:2309.16850·cs.CV·February 21, 2025·1 cites

Sketch2CAD: 3D CAD Model Reconstruction from 2D Sketch using Visual Transformer

Hong-Bin Yang

PDF

Open Access

TL;DR

This paper introduces a novel 3D reconstruction method from 2D sketches that produces CAD-compatible B-rep models, enabling more accurate and editable 3D models compared to traditional voxel or mesh-based methods.

Contribution

The paper proposes a visual transformer-based approach to predict scene descriptors from 2D sketches for CAD-compatible 3D reconstruction, addressing limitations of existing data formats.

Findings

01

Accurately reconstructs simple scenes in 3D.

02

Demonstrates potential for editable CAD models.

03

Faces challenges with complex scenes.

Abstract

Current 3D reconstruction methods typically generate outputs in the form of voxels, point clouds, or meshes. However, each of these formats has inherent limitations, such as rough surfaces and distorted structures. Additionally, these data types are not ideal for further manual editing and post-processing. In this paper, we present a novel 3D reconstruction method designed to overcome these disadvantages by reconstructing CAD-compatible models. We trained a visual transformer to predict a "scene descriptor" from a single 2D wire-frame image. This descriptor includes essential information, such as object types and parameters like position, rotation, and size. Using the predicted parameters, a 3D scene can be reconstructed with 3D modeling software that has programmable interfaces, such as Rhino Grasshopper, to build highly editable 3D models in the form of B-rep. To evaluate our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Advanced Vision and Imaging · Robotics and Sensor-Based Localization

MethodsRoIAlign · RoIPool · Softmax