Neurosymbolic Auditing of Natural-Language Software Requirements
Bethel Hall, William Eiers

TL;DR
This paper introduces VERIMED, a neurosymbolic pipeline using large language models and SMT solvers to audit natural-language software requirements, reducing ambiguity and detecting safety violations.
Contribution
The paper presents a novel neurosymbolic approach combining LLMs and formal methods for effective auditing of natural-language requirements, especially in safety-critical domains.
Findings
Stochastic variation in formalizations signals ambiguity in requirements.
Counterexample-guided repair significantly improves requirement accuracy.
VERIMED reduces ambiguity and enhances safety verification in medical-device software.
Abstract
Natural-language software requirements are often ambiguous, inconsistent, and underspecified; in safety-critical domains, these defects propagate into formal models that verify the wrong specification and into implementations that ship unsafe behavior. We show that large language models, equipped with an SMT solver, can audit such requirements: translating them into formal logic, detecting ambiguity through stochastic variation in the generated formalization, and exposing inconsistency, vacuousness, and safety violations through solver queries on the resulting specification. We present VERIMED, a neurosymbolic pipeline that operationalizes this idea for medical-device software requirements, and report two findings. First, stochastic variation across independent formalizations is a signal of ambiguity: requirements that admit multiple plausible interpretations produce SMT-inequivalent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
