Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles
Zainab Rehan, Christian Medeiros Adriano, Sona Ghahremani, Holger Giese

TL;DR
This paper presents an extension to a neuro-symbolic causal framework that uses LLMs for goal-driven rule synthesis and verification, enhancing scalability and safety in safety-critical AI systems.
Contribution
It introduces a meta-level layer with a Goal/Rule Synthesizer and Verification Engine that iteratively refines formal rules from natural language goals using LLMs.
Findings
Successfully derived minimal rule sets in autonomous driving scenarios
Demonstrated formalization of rules as logical constraints
Supported incremental and traceable rule synthesis grounded in safety principles
Abstract
Rule-based systems remain central in safety-critical domains but often struggle with scalability, brittleness, and goal misspecification. These limitations can lead to reward hacking and failures in formal verification, as AI systems tend to optimize for narrow objectives. In previous research, we developed a neuro-symbolic causal framework that integrates first-order logic abduction trees, structural causal models, and deep reinforcement learning within a MAPE-K loop to provide explainable adaptations under distribution shifts. In this paper, we extend that framework by introducing a meta-level layer designed to mitigate goal misspecification and support scalable rule maintenance. This layer consists of a Goal/Rule Synthesizer and a Rule Verification Engine, which iteratively refine a formal rule theory from high-level natural-language goals and principles provided by human experts.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
