AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs
Mingzhe Gao, Jieru Zhao, Zhe Lin, Wenchao Ding, Xiaofeng, Hou, Yu Feng, Chao Li, Minyi Guo

TL;DR
AutoVCoder is a comprehensive framework that significantly enhances the correctness and quality of Verilog code generated by LLMs through novel data, fine-tuning, and retrieval techniques.
Contribution
It introduces a systematic framework with three novel techniques to improve LLM-based Verilog code generation accuracy and quality.
Findings
Outperforms existing LLMs in Verilog correctness benchmarks.
Achieves 0.5%-2.2% improvement in functional correctness.
Increases syntax and functional correctness by 3.4% on RTLLM.
Abstract
Recently, the use of large language models (LLMs) for software code generation, e.g., C/C++ and Python, has proven a great success. However, LLMs still suffer from low syntactic and functional correctness when it comes to the generation of register-transfer level (RTL) code, such as Verilog. To address this issue, in this paper, we develop AutoVCoder, a systematic open-source framework that significantly improves the LLMs' correctness of generating Verilog code and enhances the quality of its output at the same time. Our framework integrates three novel techniques, including a high-quality hardware dataset generation approach, a two-round LLM fine-tuning method and a domain-specific retrieval-augmented generation (RAG) mechanism. Experimental results demonstrate that AutoVCoder outperforms both industrial and academic LLMs in Verilog code generation. Specifically, AutoVCoder shows a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-time simulation and control systems
