A redescription mining framework for post-hoc explaining and relating deep learning models
Matej Mihel\v{c}i\'c, Ivan Grubi\v{s}i\'c, Miha Keber

TL;DR
This paper introduces a novel, architecture-independent framework for post-hoc explanation of deep learning models using redescriptions, enabling detailed cohort analysis and interpretability of neuron activations across various models and complex labels.
Contribution
The framework is the first to use redescriptions for explaining and relating deep learning models, supporting complex labels and offering both pedagogical and decompositional rule extraction methods.
Findings
Enables cohort analysis of DLMs through statistically significant redescriptions.
Supports coupling neurons with target labels or descriptive attributes.
Works independently of neural network architecture.
Abstract
Deep learning models (DLMs) achieve increasingly high performance both on structured and unstructured data. They significantly extended applicability of machine learning to various domains. Their success in making predictions, detecting patterns and generating new data made significant impact on science and industry. Despite these accomplishments, DLMs are difficult to explain because of their enormous size. In this work, we propose a novel framework for post-hoc explaining and relating DLMs using redescriptions. The framework allows cohort analysis of arbitrary DLMs by identifying statistically significant redescriptions of neuron activations. It allows coupling neurons to a set of target labels or sets of descriptive attributes, relating layers within a single DLM or associating different DLMs. The proposed framework is independent of the artificial neural network architecture and can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Topic Modeling · Machine Learning in Healthcare
MethodsSparse Evolutionary Training
