DISCOVER: Making Vision Networks Interpretable via Competition and Dissection
Konstantinos P. Panousis, Sotirios Chatzis

TL;DR
This paper introduces DISCOVER, a framework that enhances interpretability of vision networks by leveraging stochastic local competition and multimodal models to generate textual descriptions of neuron functions, improving transparency without sacrificing accuracy.
Contribution
It presents a novel post-hoc interpretability method that uses sparse neuron activation patterns to generate human-understandable descriptions of individual neuron functions in vision networks.
Findings
High activation sparsity (~4%) achieved.
Improved or maintained classification performance.
Facilitates direct investigation of network decision processes.
Abstract
Modern deep networks are highly complex and their inferential outcome very hard to interpret. This is a serious obstacle to their transparent deployment in safety-critical or bias-aware applications. This work contributes to post-hoc interpretability, and specifically Network Dissection. Our goal is to present a framework that makes it easier to discover the individual functionality of each neuron in a network trained on a vision task; discovery is performed in terms of textual description generation. To achieve this objective, we leverage: (i) recent advances in multimodal vision-text models and (ii) network layers founded upon the novel concept of stochastic local competition between linear units. In this setting, only a small subset of layer neurons are activated for a given input, leading to extremely high activation sparsity (as low as only ). Crucially, our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques · Machine Learning in Materials Science
