LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation

Hejia Zhang; Zhongming Yu; Chia-Tung Ho; Haoxing Ren; Brucek Khailany; Jishen Zhao

arXiv:2602.16953·cs.AI·February 27, 2026

LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation

Hejia Zhang, Zhongming Yu, Chia-Tung Ho, Haoxing Ren, Brucek Khailany, Jishen Zhao

PDF

Open Access

TL;DR

This paper introduces LLM4Cov, an offline learning framework for high-coverage hardware verification that models verification as memoryless state transitions, enabling scalable learning with execution constraints and achieving high coverage with a compact model.

Contribution

The paper presents a novel offline agent-learning framework for verification, including data curation, policy-aware synthesis, and prioritized sampling, tailored for execution-aware LLM agents.

Findings

01

A 4B-parameter model achieves 69.2% coverage pass rate.

02

Outperforms its teacher model by 5.3%.

03

Demonstrates competitive performance with larger models.

Abstract

Execution-aware LLM agents offer a promising paradigm for learning from tool feedback, but such feedback is often expensive and slow to obtain, making online reinforcement learning (RL) impractical. High-coverage hardware verification exemplifies this challenge due to its reliance on industrial simulators and non-differentiable execution signals. We propose LLM4Cov, an offline agent-learning framework that models verification as memoryless state transitions guided by deterministic evaluators. Building on this formulation, we introduce execution-validated data curation, policy-aware agentic data synthesis, and worst-state-prioritized sampling to enable scalable learning under execution constraints. We further curate a reality-aligned benchmark adapted from an existing verification suite through a revised evaluation protocol. Using the proposed pipeline, a compact 4B-parameter model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Formal Methods in Verification · Software Testing and Debugging Techniques