Rule Extraction from Binary Neural Networks with Convolutional Rules for   Model Validation

Sophie Burkhardt; Jannis Brugger; Nicolas Wagner; Zahra Ahmadi,; Kristian Kersting; Stefan Kramer

arXiv:2012.08459·cs.LG·December 16, 2020

Rule Extraction from Binary Neural Networks with Convolutional Rules for Model Validation

Sophie Burkhardt, Jannis Brugger, Nicolas Wagner, Zahra Ahmadi,, Kristian Kersting, Stefan Kramer

PDF

TL;DR

This paper presents a method for extracting interpretable first-order convolutional rules from binary CNNs, enabling better understanding and validation of neural network decisions, especially for high-dimensional data like images.

Contribution

It introduces convolutional rules for rule extraction from binary neural networks, improving interpretability without sacrificing model fidelity.

Findings

01

Successfully models neural network functionality with logical rules

02

Produces visualizable and characteristic rules for high-dimensional data

03

Demonstrates effective rule extraction using stochastic local search

Abstract

Most deep neural networks are considered to be black boxes, meaning their output is hard to interpret. In contrast, logical expressions are considered to be more comprehensible since they use symbols that are semantically close to natural language instead of distributed representations. However, for high-dimensional input data such as images, the individual symbols, i.e. pixels, are not easily interpretable. We introduce the concept of first-order convolutional rules, which are logical rules that can be extracted using a convolutional neural network (CNN), and whose complexity depends on the size of the convolutional filter and not on the dimensionality of the input. Our approach is based on rule extraction from binary neural networks with stochastic local search. We show how to extract rules that are not necessarily short, but characteristic of the input, and easy to visualize. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.