The CAISAR Platform: Extending the Reach of Machine Learning Specification and Verification
Michele Alberti (LSL), Fran\c{c}ois Bobot (LSL), Julien Girard-Satabin (LSL), Alban Grastien (LSL), Aymeric Varasse, Zakaria Chihani (LSL)

TL;DR
The CAISAR platform provides a unified, extensible environment for specifying and verifying complex properties of machine learning models, supporting multiple model types and leveraging existing provers through automated translation.
Contribution
It introduces a novel specification language and an open-source platform that enable modeling and verification of complex ML properties across various model types.
Findings
Supports complex properties involving multiple models
Automates translation to existing provers
Demonstrates effectiveness on real-world use cases
Abstract
The formal specification and verification of machine learning programs saw remarkable progress in less than a decade, leading to a profusion of tools. However, diversity may lead to fragmentation, resulting in tools that are difficult to compare, except for very specific benchmarks. Furthermore, this progress is heavily geared towards the specification and verification of a certain class of property, that is, local robustness properties. But while provers are becoming more and more efficient at solving local robustness properties, even slightly more complex properties, involving multiple neural networks for example, cannot be expressed in the input languages of winners of the International Competition of Verification of Neural Networks VNN-Comp. In this tool paper, we present CAISAR, an open-source platform dedicated to machine learning specification and verification. We present its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsScientific Computing and Data Management · Advanced Data Processing Techniques
