ChipCraftBrain: Validation-First RTL Generation via Multi-Agent Orchestration
Cagri Eryilmaz

TL;DR
ChipCraftBrain is a multi-agent framework that combines symbolic-neural reasoning and adaptive orchestration to improve RTL code generation from natural language, achieving high correctness on benchmarks and FPGA validation.
Contribution
It introduces a novel adaptive multi-agent orchestration with hybrid reasoning, knowledge augmentation, and hierarchical decomposition for more accurate RTL generation.
Findings
Achieves 97.2% pass@1 on VerilogEval-Human, matching state-of-the-art.
Reaches 94.7% pass@1 on CVDP, outperforming single-shot baselines.
Successfully generates FPGA-validated RISC-V SoC modules with full correctness.
Abstract
Large Language Models (LLMs) show promise for generating Register-Transfer Level (RTL) code from natural language specifications, but single-shot generation achieves only 60-65% functional correctness on standard benchmarks. Multi-agent approaches such as MAGE reach 95.9% on VerilogEval yet remain untested on harder industrial benchmarks such as NVIDIA's CVDP, lack synthesis awareness, and incur high API costs. We present ChipCraftBrain, a framework combining symbolic-neural reasoning with adaptive multi-agent orchestration for automated RTL generation. Four innovations drive the system: (1) adaptive orchestration over six specialized agents via a PPO policy over a 168-dim state (an alternative world-model MPC planner is also evaluated); (2) a hybrid symbolic-neural architecture that solves K-map and truth-table problems algorithmically while specialized agents handle waveform timing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
