Point2Primitive: CAD Reconstruction from Point Cloud by Direct Primitive Prediction
Xinzhu Ma, Cheng Wang, Chen Tang, Bin Wang, Shixiang Tang, Yuan Meng, Yunhong Wang, Di Huang

TL;DR
Point2Primitive introduces a direct primitive prediction framework for CAD reconstruction from point clouds, improving accuracy and editability over traditional implicit neural methods by explicitly predicting sketch primitives with a transformer-based decoder.
Contribution
It presents a novel approach that directly predicts explicit CAD primitives from point clouds using an improved transformer decoder, enhancing precision and topological reconstruction.
Findings
Outperforms implicit methods in primitive accuracy
Achieves higher geometric fidelity
Effectively reconstructs CAD topology via extrusion segmentation
Abstract
Recovering CAD models from point clouds requires reconstructing their topology and sketch-based extrusion primitives. A dominant paradigm for representing sketches involves implicit neural representations such as Signed Distance Fields (SDFs). However, this indirect approach inherently struggles with precision, leading to unintended curved edges and models that are difficult to edit. In this paper, we propose Point2Primitive, a framework that learns to directly predict the explicit, parametric primitives of CAD models. Our method treats sketch reconstruction as a set prediction problem, employing a improved transformer-based decoder with explicit position queries to directly detect and predict the fundamental sketch curves (i.e., type and parameter) from the point cloud. Instead of approximating a continuous field, we formulate curve parameters as explicit position queries, which are…
Peer Reviews
Decision·Submitted to ICLR 2026
- The paper tackles an important problem that has practical relevance. - Replacing the sketch-SDF representation with a transformer is sensible and should work better in practice. - The paper delivers this message clearly.
I find this to be a poor paper, primarily because of the following: - This paper only has 30 references. Even the Point2Cyl written 4-5 years ago has 56 citations and this field has been rapidly growing since then. The paper just ignores a large body of works including, but absolutely not limited to: * Rukhovich, Danila, et al. "Cad-recode: Reverse engineering cad code from point clouds." ICCV 2025. * Dupont, Elona, et al. "Transcad: A hierarchical transformer for cad sequence infe
1. A parametric CAD modeling method is proposed that uses multiple 2D primitives (rather than a single one such as a circle), resulting in better modeling precision of CAD shapes. 2. Several 3D deep learning components are introduced to segment the input point clouds into regions corresponding to different extrusions and to predict the barrel points for parameterizing 2D sketches. A transformer module is also incorporated to refine the curve parameters. 3. The reconstruction accuracy is superior
1. The paper is not well presented. it requires considerable effort to follow its mathematical notations, which is used extensively throughout the work with many subscripts and superscripts. 2. How does the quality of the input point cloud affect the reconstruction performance of the proposed method, particularly under varying levels of data sparsity, incompleteness, or noise? 3. While the method is well tailored for CAD modeling, its reliance on predefined geometric primitives limits its genera
1. The paper identifies three concrete limitations of SDF-based methods (lack of semantic structure, blurred edges, difficult conversion) and proposes a compelling alternative through direct primitive prediction. The position-as-query formulation elegantly integrates geometric priors into the transformer decoder. 2. The evaluation includes multiple datasets (DeepCAD, Fusion 360 Gallery), diverse baselines (SDF methods, LM-based generation, primitive fitting), and importantly, an augmented datase
1. Supporting only three basic curve types (line, arc, circle) is a critical limitation. Real CAD models frequently use splines, ellipses, and B-splines. The paper acknowledges this but provides no analysis of what percentage of real CAD models can actually be represented with these three types. This fundamentally questions the practical applicability. 2. The method only handles sketch-and-extrude operations, excluding revolve, sweep, loft, boolean operations, fillets, etc. 3. Missing critical a
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications
