RTL-Repo: A Benchmark for Evaluating LLMs on Large-Scale RTL Design Projects
Ahmed Allam, Mohamed Shalan

TL;DR
RTL-Repo is a comprehensive benchmark dataset of over 4000 Verilog samples from GitHub, designed to evaluate and improve large language models' performance in complex, real-world RTL hardware design tasks.
Contribution
This paper introduces RTL-Repo, the first large-scale, publicly available benchmark dataset for assessing LLMs on real-world RTL design projects.
Findings
GPT-4 outperforms other models in Verilog code generation
VeriGen and RTLCoder show strengths in specific RTL tasks
Benchmark results highlight gaps in current LLM capabilities for RTL design
Abstract
Large Language Models (LLMs) have demonstrated potential in assisting with Register Transfer Level (RTL) design tasks. Nevertheless, there remains to be a significant gap in benchmarks that accurately reflect the complexity of real-world RTL projects. To address this, this paper presents RTL-Repo, a benchmark specifically designed to evaluate LLMs on large-scale RTL design projects. RTL-Repo includes a comprehensive dataset of more than 4000 Verilog code samples extracted from public GitHub repositories, with each sample providing the full context of the corresponding repository. We evaluate several state-of-the-art models on the RTL-Repo benchmark, including GPT-4, GPT-3.5, Starcoder2, alongside Verilog-specific models like VeriGen and RTLCoder, and compare their performance in generating Verilog code for complex projects. The RTL-Repo benchmark provides a valuable resource for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Advanced Database Systems and Queries · Distributed and Parallel Computing Systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Label Smoothing · Adam · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Absolute Position Encodings · Softmax
