Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles

Zainab Rehan; Christian Medeiros Adriano; Sona Ghahremani; Holger Giese

arXiv:2604.28087·cs.LO·May 12, 2026

Towards Neuro-symbolic Causal Rule Synthesis, Verification, and Evaluation Grounded in Legal and Safety Principles

Zainab Rehan, Christian Medeiros Adriano, Sona Ghahremani, Holger Giese

PDF

TL;DR

This paper presents an extension to a neuro-symbolic causal framework that uses LLMs for goal-driven rule synthesis and verification, enhancing scalability and safety in safety-critical AI systems.

Contribution

It introduces a meta-level layer with a Goal/Rule Synthesizer and Verification Engine that iteratively refines formal rules from natural language goals using LLMs.

Findings

01

Successfully derived minimal rule sets in autonomous driving scenarios

02

Demonstrated formalization of rules as logical constraints

03

Supported incremental and traceable rule synthesis grounded in safety principles

Abstract

Rule-based systems remain central in safety-critical domains but often struggle with scalability, brittleness, and goal misspecification. These limitations can lead to reward hacking and failures in formal verification, as AI systems tend to optimize for narrow objectives. In previous research, we developed a neuro-symbolic causal framework that integrates first-order logic abduction trees, structural causal models, and deep reinforcement learning within a MAPE-K loop to provide explainable adaptations under distribution shifts. In this paper, we extend that framework by introducing a meta-level layer designed to mitigate goal misspecification and support scalable rule maintenance. This layer consists of a Goal/Rule Synthesizer and a Rule Verification Engine, which iteratively refine a formal rule theory from high-level natural-language goals and principles provided by human experts.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.