From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation

Ke Niu; Haiyang Yu; Zhuofan Chen; Mengyang Zhao; Teng Fu; Bin Li; Xiangyang Xue

arXiv:2508.10118·cs.LG·August 19, 2025

From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation

Ke Niu, Haiyang Yu, Zhuofan Chen, Mengyang Zhao, Teng Fu, Bin Li, Xiangyang Xue

PDF

1 Video

TL;DR

This paper introduces CAD-RL, a multimodal reinforcement learning framework that improves the translation of natural language design intents into precise, executable CAD code, leveraging chain-of-thought reasoning and specialized optimization strategies.

Contribution

It proposes a novel multimodal Chain-of-Thought guided reinforcement learning approach for CAD code generation, along with a new dataset for training and benchmarking.

Findings

01

CAD-RL outperforms existing models in reasoning and precision.

02

The framework achieves higher code executability and geometric accuracy.

03

The dataset ExeCAD supports future research in CAD code generation.

Abstract

Computer-Aided Design (CAD) plays a vital role in engineering and manufacturing, yet current CAD workflows require extensive domain expertise and manual modeling effort. Recent advances in large language models (LLMs) have made it possible to generate code from natural language, opening new opportunities for automating parametric 3D modeling. However, directly translating human design intent into executable CAD code remains highly challenging, due to the need for logical reasoning, syntactic correctness, and numerical precision. In this work, we propose CAD-RL, a multimodal Chain-of-Thought (CoT) guided reinforcement learning post training framework for CAD modeling code generation. Our method combines CoT-based Cold Start with goal-driven reinforcement learning post training using three task-specific rewards: executability reward, geometric accuracy reward, and external evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation· underline