TL;DR
This paper introduces a hierarchical, geometry-aware graph representation for text-to-CAD code generation, improving geometric accuracy and constraint satisfaction in complex assemblies.
Contribution
It proposes a novel graph-based intermediate representation and a curriculum learning strategy to enhance structure understanding and code generation accuracy.
Findings
Outperforms existing methods in geometric fidelity.
Achieves higher constraint satisfaction accuracy.
Builds a new 12K dataset with annotations and metrics.
Abstract
Text-to-CAD code generation is a long-horizon task that translates textual instructions into long sequences of interdependent operations. Existing methods typically decode text directly into executable code (e.g., bpy) without explicitly modeling assembly hierarchy or geometric constraints, which enlarges the search space, accumulates local errors, and often causes cascading failures in complex assemblies. To address this issue, we propose a hierarchical and geometry-aware graph as an intermediate representation. The graph models multi-level parts and components as nodes and encodes explicit geometric constraints as edges. Instead of mapping text directly to code, our framework first predicts structure and constraints, then conditions action sequencing and code generation, thereby improving geometric fidelity and constraint satisfaction. We further introduce a structure-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
