Rule induction for global explanation of trained models

Madhumita Sushil; Simon \v{S}uster; Walter Daelemans

arXiv:1808.09744·cs.CL·August 30, 2018

Rule induction for global explanation of trained models

Madhumita Sushil, Simon \v{S}uster, Walter Daelemans

PDF

1 Repo

TL;DR

This paper introduces a rule induction method that globally explains trained neural network predictions by capturing feature-class relations, improving interpretability and trust in automated systems.

Contribution

It proposes a novel technique to induce rule sets that explain neural network predictions by integrating feature importance and simplifying input space.

Findings

01

Achieved macro-averaged F-score of 0.80 on 20 newsgroups dataset

02

Captured feature-class relations effectively with rule sets

03

Enhanced interpretability of neural network predictions

Abstract

Understanding the behavior of a trained network and finding explanations for its outputs is important for improving the network's performance and generalization ability, and for ensuring trust in automated systems. Several approaches have previously been proposed to identify and visualize the most important features by analyzing a trained network. However, the relations between different features and classes are lost in most cases. We propose a technique to induce sets of if-then-else rules that capture these relations to globally explain the predictions of a network. We first calculate the importance of the features in the trained network. We then weigh the original inputs with these feature importance scores, simplify the transformed input space, and finally fit a rule induction model to explain the model predictions. We find that the output rule-sets can explain the predictions of a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clips/interpret_with_rules
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.