A Causal Argumentation Method for Explainability of Machine Learning Models

Henry Salgado; Meagan R. Kendall; Martine Ceberio

arXiv:2605.21758·cs.AI·May 22, 2026

A Causal Argumentation Method for Explainability of Machine Learning Models

Henry Salgado, Meagan R. Kendall, Martine Ceberio

PDF

TL;DR

This paper introduces a novel explainability method for machine learning models that combines causality detection with argumentation frameworks to clarify decision-making processes.

Contribution

It integrates causal discovery with bipolar argumentation frameworks to enhance interpretability of model predictions.

Findings

01

Effectively identifies causal relationships among features.

02

Provides explanations using argumentation semantics.

03

Outperforms standard post-hoc explainability methods on benchmarks.

Abstract

Explainable AI (XAI) methods identify which features are relevant to a model's predictions but often fail to clarify why certain decisions are made. In this work, we present a novel method that integrates causality with argument-based reasoning to explain why models may be making predictions. Our approach first identifies causal relationships among variables using causal discovery methods and then translates these into a Bipolar Argumentation Framework (BAF) to represent supportive and opposing interactions among features. By using semi-stable semantics, we find extensions of features that explain why certain outcomes may have been chosen. We demonstrate our method on two benchmark datasets and compare its results against standard post-hoc explainability approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.