Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems
Parand A. Alamdari, Toryn Q. Klassen, Sheila A. McIlraith

TL;DR
This paper combines formal methods with state-of-the-art machine learning to develop techniques for auditing and monitoring advanced AI systems, especially LLMs, to ensure compliance with safety and regulatory constraints.
Contribution
It introduces novel offline and online monitoring methods using Linear Temporal Logic to detect and mitigate violations in black-box AI systems, improving safety and compliance.
Findings
Formal syntax enhances violation detection over baseline methods.
Predictive monitors reduce violation rates significantly.
LLMs' temporal reasoning degrades with complexity and event distance.
Abstract
We examine one particular dimension of AI governance: how to monitor and audit AI-enabled products and services throughout the AI development lifecycle, from pre-deployment testing to post-deployment auditing. Combining principles from formal methods with SoTA machine learning, we propose techniques that enable AI-enabled product and service developers, as well as third party AI developers and evaluators, to perform offline auditing and online (runtime) monitoring of product-specific (temporally extended) behavioral constraints such as safety constraints, norms, rules and regulations with respect to black-box advanced AI systems, notably LLMs. We further provide practical techniques for predictive monitoring, such as sampling-based methods, and we introduce intervening monitors that act at runtime to preempt and potentially mitigate predicted violations. Experimental results show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
