Are LLMs Ready for Practical Adoption for Assertion Generation?
Vaishnavi Pulavarthi, Deeksha Nandal, Soham Dan, Debjit Pal

TL;DR
This paper evaluates the effectiveness of large language models in generating hardware assertions, finding current models insufficient and proposing a fine-tuned model, AssertionLLM, that improves correctness.
Contribution
The paper introduces AssertionLLM, a specialized LLM fine-tuned for assertion generation, and provides a comprehensive evaluation framework using AssertionBench.
Findings
COTS LLMs produce many incorrect assertions
AssertionLLM significantly improves assertion correctness
Current LLMs are not yet ready for practical assertion generation
Abstract
Assertions have been the de facto collateral for simulation-based and formal verification of hardware designs for over a decade. The quality of hardware verification, i.e., detection and diagnosis of corner-case design bugs, is critically dependent on the quality of the assertions. With the onset of generative AI such as Transformers and Large-Language Models (LLMs), there has been a renewed interest in developing novel, effective, and scalable techniques of generating functional and security assertions from design source code. While there have been recent works that use commercial-of-the-shelf (COTS) LLMs for assertion generation, there is no comprehensive study in quantifying the effectiveness of LLMs in generating syntactically and semantically correct assertions. In this paper, we first discuss AssertionBench from our prior work, a comprehensive set of designs and assertions to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
