AutoVCoder: A Systematic Framework for Automated Verilog Code Generation   using LLMs

Mingzhe Gao; Jieru Zhao; Zhe Lin; Wenchao Ding; Xiaofeng; Hou; Yu Feng; Chao Li; Minyi Guo

arXiv:2407.18333·cs.AR·July 29, 2024·2 cites

AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs

Mingzhe Gao, Jieru Zhao, Zhe Lin, Wenchao Ding, Xiaofeng, Hou, Yu Feng, Chao Li, Minyi Guo

PDF

Open Access

TL;DR

AutoVCoder is a comprehensive framework that significantly enhances the correctness and quality of Verilog code generated by LLMs through novel data, fine-tuning, and retrieval techniques.

Contribution

It introduces a systematic framework with three novel techniques to improve LLM-based Verilog code generation accuracy and quality.

Findings

01

Outperforms existing LLMs in Verilog correctness benchmarks.

02

Achieves 0.5%-2.2% improvement in functional correctness.

03

Increases syntax and functional correctness by 3.4% on RTLLM.

Abstract

Recently, the use of large language models (LLMs) for software code generation, e.g., C/C++ and Python, has proven a great success. However, LLMs still suffer from low syntactic and functional correctness when it comes to the generation of register-transfer level (RTL) code, such as Verilog. To address this issue, in this paper, we develop AutoVCoder, a systematic open-source framework that significantly improves the LLMs' correctness of generating Verilog code and enhances the quality of its output at the same time. Our framework integrates three novel techniques, including a high-quality hardware dataset generation approach, a two-round LLM fine-tuning method and a domain-specific retrieval-augmented generation (RAG) mechanism. Experimental results demonstrate that AutoVCoder outperforms both industrial and academic LLMs in Verilog code generation. Specifically, AutoVCoder shows a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReal-time simulation and control systems