ScaleRTL: Scaling LLMs with Reasoning Data and Test-Time Compute for Accurate RTL Code Generation

Chenhui Deng; Yun-Da Tsai; Guan-Ting Liu; Zhongzhi Yu; Haoxing Ren

arXiv:2506.05566·cs.AR·July 17, 2025

ScaleRTL: Scaling LLMs with Reasoning Data and Test-Time Compute for Accurate RTL Code Generation

Chenhui Deng, Yun-Da Tsai, Guan-Ting Liu, Zhongzhi Yu, Haoxing Ren

PDF

Open Access

TL;DR

ScaleRTL is a reasoning large language model designed for RTL code generation, leveraging extensive reasoning data and test-time scaling to significantly improve performance over existing models.

Contribution

The paper introduces ScaleRTL, the first reasoning LLM for RTL coding that scales reasoning data and test-time compute, enabling deeper RTL reasoning and better code generation.

Findings

01

Achieves state-of-the-art results on VerilogEval and RTLLM benchmarks.

02

Outperforms 18 baselines by up to 18.4% on VerilogEval.

03

Utilizes a large dataset of 3.5B tokens with chain-of-thought reasoning traces.

Abstract

Recent advances in large language models (LLMs) have enabled near-human performance on software coding benchmarks, but their effectiveness in RTL code generation remains limited due to the scarcity of high-quality training data. While prior efforts have fine-tuned LLMs for RTL tasks, they do not fundamentally overcome the data bottleneck and lack support for test-time scaling due to their non-reasoning nature. In this work, we introduce ScaleRTL, the first reasoning LLM for RTL coding that scales up both high-quality reasoning data and test-time compute. Specifically, we curate a diverse set of long chain-of-thought reasoning traces averaging 56K tokens each, resulting in a dataset of 3.5B tokens that captures rich RTL knowledge. Fine-tuning a general-purpose reasoning model on this corpus yields ScaleRTL that is capable of deep RTL reasoning. Subsequently, we further enhance the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Natural Language Processing Techniques · Software Testing and Debugging Techniques

MethodsSparse Evolutionary Training