HSCO-Bench: An Agent-Driven End-to-End Hardware-Software Co-design Benchmark for Systems-on-Chip
Pei-Huan Tsai, Kuan-Lin Chiu, William Baisi, Pin-Yu Chen, Luca P. Carloni

TL;DR
HSCO-Bench is a novel benchmark designed to evaluate large language models' ability to perform end-to-end hardware-software co-design for heterogeneous SoC systems, highlighting current limitations and potential for future optimization.
Contribution
This paper introduces HSCO-Bench, the first comprehensive benchmark for evaluating LLMs in full hardware-software co-design of heterogeneous SoCs.
Findings
Only two of five frontier models could generate valid SoC prototypes.
Generated designs are far from optimal, with peak speedup of 16.22X and resource utilization of 23.67%.
End-to-end integration remains challenging for current models.
Abstract
Large language models (LLMs) are adopted for software and hardware design, yet these domains are still evaluated separately. Software benchmarks typically assume fixed hardware targets, while hardware benchmarks focus on component-level optimization without considering the full hardware-software stack. Consequently, no existing benchmark evaluates whether an LLM agent can perform end-to-end, system-level hardware-software co-design. Such a process requires: 1) analyzing applications to identify kernels requiring acceleration, 2) designing and integrating heterogeneous accelerators into a System-on-Chip (SoC) under resource constraints, and 3) mapping kernels onto the generated accelerators. We present HSCO-Bench, an end-to-end hardware-software co-design benchmark for accelerator-rich heterogeneous SoC generation. Built upon an open-source SoC platform with a curated repository…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
