CircuChain: Disentangling Competence and Compliance in LLM Circuit Analysis
Mayank Ravishankara

TL;DR
CircuChain is a benchmark that evaluates whether large language models genuinely understand circuit physics or merely follow training priors, revealing a divergence between reasoning ability and instruction compliance.
Contribution
It introduces a diagnostic framework with a multi-stage verification pipeline to disentangle physical reasoning from convention adherence in LLMs for circuit analysis.
Findings
Strong models excel in physics reasoning but often violate conventions.
Weaker models follow instructions better but lack physical fidelity.
Increased capability does not necessarily improve constraint adherence.
Abstract
As large language models (LLMs) advance toward expert-level performance in engineering domains, reliable reasoning under user-specified constraints becomes critical. In circuit analysis, for example, a numerically correct solution is insufficient if it violates established methodological conventions such as mesh directionality or polarity assignments, errors that can propagate in safety-critical systems. Yet it remains unclear whether frontier models truly apply first-principles reasoning or rely on entrenched training priors that conflict with explicit instructions. We introduce CircuChain, a diagnostic benchmark designed to disentangle instruction compliance from physical reasoning competence in electrical circuit analysis. CircuChain consists of counterbalanced Control/Trap problem pairs across five canonical circuit topologies, augmented with systematic variations in sign…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
