iScript: A Domain-Adapted Large Language Model and Benchmark for Physical Design Tcl Script Generation
Ning Xu, Zhaoyang Zhang, Senlin Shu, Lei Qi, Jiaqi Lv, Wensuo Wang, Tianhao Zhao, Chao Zhang, Zhaoliang Yang, Xiangyu Li, Zhaorui Su, Jingshan Li, Xin Geng

TL;DR
iScript is a domain-adapted large language model designed for generating Tcl scripts in physical design, utilizing a novel data synthesis pipeline and a comprehensive benchmark to improve reliability and performance.
Contribution
The paper introduces iScript, a domain-specific LLM for EDA scripting, along with a new benchmark and a multi-stage data synthesis pipeline for training and evaluation.
Findings
iScript outperforms state-of-the-art LLMs on the benchmark.
The data synthesis pipeline effectively generates training data.
The two-step verification improves script correctness assessment.
Abstract
Modern EDA flows rely heavily on Tcl scripting, yet general LLMs perform poorly in this domain due to extreme data scarcity, domain-specific semantics, and the high reliability required in physical design. We present iScript, a domain-adapted Qwen3-8B model for Innovus Tcl script generation, and iScript-Bench, a comprehensive benchmark covering five task categories and three difficulty levels. To overcome the lack of training data, we introduce a multi-stage data synthesis pipeline that integrates command extraction, static linting, requirement back-inference, and Chain-of-Thought generation, producing a 10K-tuple (requirement, CoT, script) dataset. iScript is trained through a two-stage strategy combining domain-adaptive pretraining and supervised fine-tuning. To evaluate script correctness efficiently, we further propose a two-step verification framework consisting of static syntax…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques
