Pathwise Explanation of ReLU Neural Networks

Seongwoo Lim; Won Jo; Joohyung Lee; Jaesik Choi

arXiv:2506.18037·cs.LG·June 24, 2025

Pathwise Explanation of ReLU Neural Networks

Seongwoo Lim, Won Jo, Joohyung Lee, Jaesik Choi

PDF

TL;DR

This paper introduces a pathwise explanation method for ReLU neural networks that enhances interpretability by analyzing decision paths, offering flexible and detailed explanations that outperform existing methods.

Contribution

It presents a novel pathwise explanation approach that considers subsets of hidden units, improving transparency and interpretability of ReLU networks.

Findings

01

Outperforms existing explanation methods quantitatively.

02

Provides more consistent and detailed explanations.

03

Flexible adjustment of explanation scope within inputs.

Abstract

Neural networks have demonstrated a wide range of successes, but their ``black box" nature raises concerns about transparency and reliability. Previous research on ReLU networks has sought to unwrap these networks into linear models based on activation states of all hidden units. In this paper, we introduce a novel approach that considers subsets of the hidden units involved in the decision making path. This pathwise explanation provides a clearer and more consistent understanding of the relationship between the input and the decision-making process. Our method also offers flexibility in adjusting the range of explanations within the input, i.e., from an overall attribution input to particular components within the input. Furthermore, it allows for the decomposition of explanations for a given input for more detailed explanations. Experiments demonstrate that our method outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.