The CAISAR Platform: Extending the Reach of Machine Learning Specification and Verification

Michele Alberti (LSL); Fran\c{c}ois Bobot (LSL); Julien Girard-Satabin (LSL); Alban Grastien (LSL); Aymeric Varasse; Zakaria Chihani (LSL)

arXiv:2506.12084·cs.SE·January 21, 2026

The CAISAR Platform: Extending the Reach of Machine Learning Specification and Verification

Michele Alberti (LSL), Fran\c{c}ois Bobot (LSL), Julien Girard-Satabin (LSL), Alban Grastien (LSL), Aymeric Varasse, Zakaria Chihani (LSL)

PDF

Open Access 1 Video

TL;DR

The CAISAR platform provides a unified, extensible environment for specifying and verifying complex properties of machine learning models, supporting multiple model types and leveraging existing provers through automated translation.

Contribution

It introduces a novel specification language and an open-source platform that enable modeling and verification of complex ML properties across various model types.

Findings

01

Supports complex properties involving multiple models

02

Automates translation to existing provers

03

Demonstrates effectiveness on real-world use cases

Abstract

The formal specification and verification of machine learning programs saw remarkable progress in less than a decade, leading to a profusion of tools. However, diversity may lead to fragmentation, resulting in tools that are difficult to compare, except for very specific benchmarks. Furthermore, this progress is heavily geared towards the specification and verification of a certain class of property, that is, local robustness properties. But while provers are becoming more and more efficient at solving local robustness properties, even slightly more complex properties, involving multiple neural networks for example, cannot be expressed in the input languages of winners of the International Competition of Verification of Neural Networks VNN-Comp. In this tool paper, we present CAISAR, an open-source platform dedicated to machine learning specification and verification. We present its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The CAISAR Platform: Extending the Reach of Machine Learning Specification and Verification· underline

Taxonomy

TopicsScientific Computing and Data Management · Advanced Data Processing Techniques