Actionable Interpretability Must Be Defined in Terms of Symmetries

Pietro Barbiero; Mateo Espinosa Zarlenga; Francesco Giannini; Alberto Termine; Filippo Bonchi; Mateja Jamnik; Giuseppe Marra

arXiv:2601.12913·cs.AI·January 30, 2026

Actionable Interpretability Must Be Defined in Terms of Symmetries

Pietro Barbiero, Mateo Espinosa Zarlenga, Francesco Giannini, Alberto Termine, Filippo Bonchi, Mateja Jamnik, Giuseppe Marra

PDF

Open Access

TL;DR

This paper proposes that interpretability in AI should be defined through symmetries that enable formal testing, model design, and compliance verification, offering a unified probabilistic framework.

Contribution

It introduces a symmetry-based framework for defining and testing interpretability, unifying various interpretability concepts under a probabilistic model.

Findings

01

Identifies four key symmetries for interpretability: inference equivariance, information invariance, concept-closure invariance, structural invariance.

02

Formalizes interpretable models as a subclass of probabilistic models.

03

Provides a framework for verifying interpretability compliance with safety standards.

Abstract

This paper argues that interpretability research in Artificial Intelligence (AI) is fundamentally ill-posed as existing definitions of interpretability fail to describe how interpretability can be formally tested or designed for. We posit that actionable definitions of interpretability must be formulated in terms of *symmetries* that inform model design and lead to testable conditions. Under a probabilistic view, we hypothesise that four symmetries (inference equivariance, information invariance, concept-closure invariance, and structural invariance) suffice to (i) formalise interpretable models as a subclass of probabilistic models, (ii) yield a unified formulation of interpretable inference (e.g., alignment, interventions, and counterfactuals) as a form of Bayesian inversion, and (iii) provide a formal framework to verify compliance with safety standards and regulations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Embodied and Extended Cognition