Layerwise Knowledge Extraction from Deep Convolutional Networks
Simon Odense, Artur d'Avila Garcez

TL;DR
This paper introduces a layerwise rule extraction method for deep networks, revealing that while internal features are hard to explain, output layers are highly interpretable with simple, accurate rules.
Contribution
Proposes a novel M-of-N rule extraction technique that balances complexity and accuracy, demonstrating its effectiveness on deep networks and highlighting limitations in explaining internal layers.
Findings
Internal layers often lack simple, accurate rules
Output layers can be explained with very compact rules
Rule extraction is useful for explaining network modules
Abstract
Knowledge extraction is used to convert neural networks into symbolic descriptions with the objective of producing more comprehensible learning models. The central challenge is to find an explanation which is more comprehensible than the original model while still representing that model faithfully. The distributed nature of deep networks has led many to believe that the hidden features of a neural network cannot be explained by logical descriptions simple enough to be comprehensible. In this paper, we propose a novel layerwise knowledge extraction method using M-of-N rules which seeks to obtain the best trade-off between the complexity and accuracy of rules describing the hidden features of a deep network. We show empirically that this approach produces rules close to an optimal complexity-error tradeoff. We apply this method to a variety of deep networks and find that in the internal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax
