BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

Haozhe Zhang; Kaichen Liu; Miaomiao Chen; Lei Li; Shaojie Yang; Cheng Peng; Hanjie Chen

arXiv:2605.10865·cs.AI·May 13, 2026

BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

Haozhe Zhang, Kaichen Liu, Miaomiao Chen, Lei Li, Shaojie Yang, Cheng Peng, Hanjie Chen

PDF

TL;DR

BenchCAD is a comprehensive benchmark for evaluating multimodal AI models' ability to generate accurate, executable CAD programs across various industrial parts, highlighting current limitations and areas for improvement.

Contribution

The paper introduces BenchCAD, a large, verified dataset and evaluation framework for assessing AI models' performance in industrial CAD reasoning tasks.

Findings

01

Current models recover coarse geometry but struggle with detailed parametric accuracy.

02

Common failures include missing fine 3D structures and misinterpreting design parameters.

03

Fine-tuning improves in-distribution performance but struggles with unseen parts.

Abstract

Industrial Computer-Aided Design (CAD) code generation requires models to produce executable parametric programs from visual or textual inputs. Beyond recognizing the outer shape of a part, this task involves understanding its 3D structure, inferring engineering parameters, and choosing CAD operations that reflect how the part would be designed and manufactured. Despite the promise of Multimodal large language models (MLLMs) for this task, they are rarely evaluated on whether these capabilities jointly hold in realistic industrial CAD settings. We present BenchCAD, a unified benchmark for industrial CAD reasoning. BenchCAD contains 17,900 execution-verified CadQuery programs across 106 industrial part families, including bevel gears, compression springs, twist drills, and other reusable engineering designs. It evaluates models through visual question answering, code question answering,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.