Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains

Octavian Untila

arXiv:2603.21149·cs.SE·March 27, 2026

Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains

Octavian Untila

PDF

Open Access

TL;DR

An autonomous AI ecosystem independently discovered the effectiveness of SMT-based formal verification across six safety domains, demonstrating emergent reasoning capabilities and proposing a unified verification framework.

Contribution

The paper shows that formal verification emerged spontaneously in an AI ecosystem and introduces a unified Z3-based verification framework applicable across multiple safety domains.

Findings

01

Achieved 100% accuracy on 181 test cases across five domains.

02

Detected real bugs including an INT_MIN overflow and unverifiable string parameters.

03

Formal verification proved effective as an emergent property in complex AI systems.

Abstract

An autonomous AI ecosystem (SUBSTRATE S3), generating product specifications without explicit instructions about formal methods, independently proposed the use of Z3 SMT solver across six distinct domains of AI safety: verification of LLM-generated code, tool API safety for AI agents, post-distillation reasoning correctness, CLI command validation, hardware assembly verification, and smart contract safety. These convergent discoveries, occurring across 8 products over 13 days with Jaccard similarity below 15% between variants, suggest that formal verification is not merely a useful technique for AI safety but an emergent property of any sufficiently complex system reasoning about its own safety. We propose a unified framework (substrate-guard) that applies Z3-based verification across all six output classes through a common API, and evaluate it on 181 test cases across five implemented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Adversarial Robustness in Machine Learning · Software Testing and Debugging Techniques