STEP-LLM: Generating CAD STEP Models from Natural Language with Large Language Models
Xiangyu Shi, Junyang Ding, Xu Zhao, Sinong Zhan, Payal Mohapatra, Daniel Quispe, Kojo Welbeck, Jian Cao, Wei Chen, Ping Guo, Qi Zhu

TL;DR
This paper introduces STEP-LLM, a large language model fine-tuned to generate CAD models in the universal STEP format from natural language, overcoming format-specific limitations and improving geometric accuracy.
Contribution
The paper presents a novel dataset, preprocessing techniques, and a multi-stage framework including retrieval, reserialization, and reinforcement learning to generate accurate STEP models from text.
Findings
STEP-LLM outperforms Text2CAD in geometric fidelity.
Retrieval-augmented generation improves model completeness.
Reserialization and reinforcement learning enhance accuracy and reduce geometric discrepancies.
Abstract
Computer-aided design (CAD) is vital to modern manufacturing, yet model creation remains labor-intensive and expertise-heavy. To enable non-experts to translate intuitive design intent into manufacturable artifacts, recent large language models-based text-to-CAD efforts focus on command sequences or script-based formats like CadQuery. However, these formats are kernel-dependent and lack universality for manufacturing. In contrast, the Standard for the Exchange of Product Data (STEP, ISO 10303) file is a widely adopted, neutral boundary representation (B-rep) format directly compatible with manufacturing, but its graph-structured, cross-referenced nature poses unique challenges for auto-regressive LLMs. To address this, we curate a dataset of ~40K STEP-caption pairs and introduce novel preprocessing tailored for the graph-structured format of STEP, including a depth-first search-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization · 3D Shape Modeling and Analysis · Design Education and Practice
