Causal Explanations from the Geometric Properties of ReLU Neural Networks

Hector Woods; Philippa Ryan; Rob Alexander

arXiv:2605.10396·cs.LG·May 12, 2026

Causal Explanations from the Geometric Properties of ReLU Neural Networks

Hector Woods, Philippa Ryan, Rob Alexander

PDF

TL;DR

This paper introduces a geometric approach to generate causal explanations for ReLU neural networks by analyzing their piecewise linear regions, providing accurate insights into their decision-making processes.

Contribution

It presents a novel method that extracts causal explanations directly from the geometric properties of ReLU networks, avoiding model distillation.

Findings

01

The geometric representation aligns with the network's decision process.

02

Causal explanations derived from geometry are accurate reflections of the network.

03

The method enables understanding of neural network behavior without performance degradation.

Abstract

Neural networks have proved an effective means of learning control policies for autonomous systems, but these learned policies are difficult to understand due to the black-box nature of neural networks. This lack of interpretability makes safety assurance for such autonomous systems challenging. The fields of eXplainable Artificial Intelligence (XAI) and eXplainable Reinforcement Learning (XRL) aim to interpret the decision making processes of neural networks and autonomous agents, respectively. In particular, work on causal explanations aims to provide "why" and "why not" explanations for why a model made a given decision. However, most of the work on explainability to date utilises a distilled version of the original model. While this distilled policy is interpretable, it necessarily degrades in performance significantly when compared to the original model, and is not guaranteed to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.