HWE-Bench: Can Language Models Perform Board-level Schematic Designs?
Weibo Qiu, Yinhao Xiao, Runyu Pan

TL;DR
HWE-Bench evaluates how well large language models can perform complex board-level schematic designs by testing their ability to generate functional circuits based on detailed specifications and datasheets.
Contribution
This paper introduces HWE-Bench, a comprehensive benchmark with 300 tasks and a datasheet knowledge base to assess LLMs' capabilities in board-level circuit design.
Findings
Current models achieve only 8.15% pass rate.
Models understand documentation but lack physical intuition.
Benchmark highlights gaps in LLMs' circuit design skills.
Abstract
Large Language Models (LLMs) have demonstrated significant potential in various engineering tasks, including software development, digital logic generation, and companion document maintenance. However, their ability to perform board-level circuit design is understudied, as this task requires a synergized understanding of real-world physics and Integrated Circuit (IC) datasheets, the latter comprising detailed specifications for individual components. To address this challenge, we propose \hweb, an evaluation framework that benchmarks the ability of LLMs to perform such designs. It consists of 300 board-level design tasks pulled from open-source and crowdsourcing platforms such as GitHub and OSHWLab, covering 8 application domains, and is complemented with a knowledge base of 2,914 real IC datasheets. For each task, the LLMs are tasked with generating a schematic from scratch, using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Physical Unclonable Functions (PUFs) and Hardware Security · Big Data and Digital Economy
