ERIC: Extracting Relations Inferred from Convolutions
Joe Townsend, Theodoros Kasioumis, Hiroya Inakoshi

TL;DR
This paper introduces ERIC, a method to approximate CNN kernel behavior with logic programs, enabling better interpretability and analysis of neural network features and relationships.
Contribution
ERIC is the first approach to extract logic programs from CNN kernels across multiple layers, linking neural behavior with symbolic logic for interpretability.
Findings
Logic programs approximate CNN accuracy with some information loss.
Extracted rules identify key kernels and their relationships.
Kernels react strongly to image sets, dividing classes into sub-classes.
Abstract
Our main contribution is to show that the behaviour of kernels across multiple layers of a convolutional neural network can be approximated using a logic program. The extracted logic programs yield accuracies that correlate with those of the original model, though with some information loss in particular as approximations of multiple layers are chained together or as lower layers are quantised. We also show that an extracted program can be used as a framework for further understanding the behaviour of CNNs. Specifically, it can be used to identify key kernels worthy of deeper inspection and also identify relationships with other kernels in the form of the logical rules. Finally, we make a preliminary, qualitative assessment of rules we extract from the last convolutional layer and show that kernels identified are symbolic in that they react strongly to sets of similar images that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
