Towards Interpretable ANNs: An Exact Transformation to Multi-Class Multivariate Decision Trees
Duy T. Nguyen, Kathryn E. Kasmarik, Hussein A. Abbass

TL;DR
This paper introduces two novel algorithms that transform neural networks with ReLU activations into interpretable multivariate decision trees, enabling rule extraction and better understanding of neural network decisions.
Contribution
The paper presents exact and extended algorithms for converting ANNs into multivariate decision trees, improving interpretability and fidelity of neural network models.
Findings
EC-DT preserves ANN decision boundaries accurately.
Extended C-Net produces compact, effective decision trees.
Both methods enable rule extraction with attribute combinations.
Abstract
On the one hand, artificial neural networks (ANNs) are commonly labelled as black-boxes, lacking interpretability; an issue that hinders human understanding of ANNs' behaviors. A need exists to generate a meaningful sequential logic of the ANN for interpreting a production process of a specific output. On the other hand, decision trees exhibit better interpretability and expressive power due to their representation language and the existence of efficient algorithms to transform the trees into rules. However, growing a decision tree based on the available data could produce larger than necessary trees or trees that do not generalise well. In this paper, we introduce two novel multivariate decision tree (MDT) algorithms for rule extraction from ANNs: an Exact-Convertible Decision Tree (EC-DT) and an Extended C-Net algorithm. They both transform a neural network with Rectified Linear Unit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Neural Networks and Applications · Imbalanced Data Classification Techniques
MethodsInterpretability
