Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution

Ruta Binkyte; Ivaxi Sheth; Zhijing Jin; Mohammad Havaei; Bernhard Sch\"olkopf; Mario Fritz

arXiv:2605.02640·cs.AI·May 5, 2026

Trustworthy AI Suffers from Invariance Conflicts and Causality is The Solution

Ruta Binkyte, Ivaxi Sheth, Zhijing Jin, Mohammad Havaei, Bernhard Sch\"olkopf, Mario Fritz

PDF

TL;DR

This paper argues that causality is essential for understanding and resolving trade-offs among fairness, robustness, privacy, and explainability in trustworthy AI, applicable to both classical models and foundation models.

Contribution

It introduces a causality-based framework to interpret and address invariance conflicts in trustworthy AI objectives, offering a unifying perspective.

Findings

01

Causality helps understand trade-offs as invariance conflicts.

02

Selective invariance can soften or resolve trade-offs.

03

Causal assumptions are relevant in large-scale AI systems.

Abstract

As artificial intelligence (AI), including machine learning (ML) models and foundation models (FMs), is increasingly deployed in high-stakes domains, ensuring their trustworthiness has become a central challenge. However, the core trustworthy AI objectives, such as fairness, robustness, privacy, and explainability, are hard to achieve simultaneously, especially while preserving utility. This position paper argues that causality is necessary to understand and balance trade-offs in performance and multiple objectives of trustworthy AI. We ground our arguments in re-interpreting trustworthy AI trade-offs as incompatible invariance requirements under different changes to the data-generating process. We then illustrate that causality provides a unifying framework for understanding how trade-offs in trustworthy AI arise, and how they can be softened or resolved through selective invariance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.