Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems

Parand A. Alamdari; Toryn Q. Klassen; Sheila A. McIlraith

arXiv:2605.16198·cs.AI·May 18, 2026

Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems

Parand A. Alamdari, Toryn Q. Klassen, Sheila A. McIlraith

PDF

TL;DR

This paper combines formal methods with state-of-the-art machine learning to develop techniques for auditing and monitoring advanced AI systems, especially LLMs, to ensure compliance with safety and regulatory constraints.

Contribution

It introduces novel offline and online monitoring methods using Linear Temporal Logic to detect and mitigate violations in black-box AI systems, improving safety and compliance.

Findings

01

Formal syntax enhances violation detection over baseline methods.

02

Predictive monitors reduce violation rates significantly.

03

LLMs' temporal reasoning degrades with complexity and event distance.

Abstract

We examine one particular dimension of AI governance: how to monitor and audit AI-enabled products and services throughout the AI development lifecycle, from pre-deployment testing to post-deployment auditing. Combining principles from formal methods with SoTA machine learning, we propose techniques that enable AI-enabled product and service developers, as well as third party AI developers and evaluators, to perform offline auditing and online (runtime) monitoring of product-specific (temporally extended) behavioral constraints such as safety constraints, norms, rules and regulations with respect to black-box advanced AI systems, notably LLMs. We further provide practical techniques for predictive monitoring, such as sampling-based methods, and we introduce intervening monitors that act at runtime to preempt and potentially mitigate predicted violations. Experimental results show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.