CreativEval: Evaluating Creativity of LLM-Based Hardware Code Generation
Matthew DeLorenzo, Vasudev Gohil, Jeyavijayan Rajendran

TL;DR
This paper introduces CreativeEval, a framework for quantifying the creativity of large language models in hardware code generation, focusing on aspects like originality and flexibility, and evaluates several models including GPT-3.5.
Contribution
The paper proposes a novel framework for measuring creativity in LLM-generated hardware designs, addressing a gap in existing evaluation methods.
Findings
GPT-3.5 exhibits the highest creativity among tested models.
CreativeEval effectively quantifies creativity in hardware code generation.
The framework distinguishes models based on fluency, flexibility, originality, and elaboration.
Abstract
Large Language Models (LLMs) have proved effective and efficient in generating code, leading to their utilization within the hardware design process. Prior works evaluating LLMs' abilities for register transfer level code generation solely focus on functional correctness. However, the creativity associated with these LLMs, or the ability to generate novel and unique solutions, is a metric not as well understood, in part due to the challenge of quantifying this quality. To address this research gap, we present CreativeEval, a framework for evaluating the creativity of LLMs within the context of generating hardware designs. We quantify four creative sub-components, fluency, flexibility, originality, and elaboration, through various prompting and post-processing techniques. We then evaluate multiple popular LLMs (including GPT models, CodeLlama, and VeriGen) upon this creativity metric,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Machine Learning in Materials Science · Ferroelectric and Negative Capacitance Devices
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Dropout · Adam · Linear Layer · Layer Normalization · Discriminative Fine-Tuning · Weight Decay · Byte Pair Encoding
