GenCAD: Image-Conditioned Computer-Aided Design Generation with Transformer-Based Contrastive Representation and Diffusion Priors
Md Ferdous Alam, Faez Ahmed

TL;DR
GenCAD is a novel generative model that converts images into editable 3D CAD models using transformer-based contrastive learning and diffusion priors, improving CAD model generation and retrieval.
Contribution
It introduces a new image-to-CAD generation framework combining transformers, contrastive learning, and diffusion models, enabling editable and retrievable CAD models from images.
Findings
Outperforms existing methods in CAD model generation
Enables image-based CAD model retrieval from large databases
Facilitates faster and more intuitive CAD design processes
Abstract
The creation of manufacturable and editable 3D shapes through Computer-Aided Design (CAD) remains a highly manual and time-consuming task, hampered by the complex topology of boundary representations of 3D solids and unintuitive design tools. While most work in the 3D shape generation literature focuses on representations like meshes, voxels, or point clouds, practical engineering applications demand the modifiability and manufacturability of CAD models and the ability for multi-modal conditional CAD model generation. This paper introduces GenCAD, a generative model that employs autoregressive transformers with a contrastive learning framework and latent diffusion models to transform image inputs into parametric CAD command sequences, resulting in editable 3D shape representations. Extensive evaluations demonstrate that GenCAD significantly outperforms existing state-of-the-art methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization · Image Processing and 3D Reconstruction · BIM and Construction Integration
MethodsContrastive Learning · Diffusion
