HINT: Hierarchical Neuron Concept Explainer
Andong Wang, Wei-Ning Lee, Xiaojuan Qi

TL;DR
HINT is a scalable method for interpreting deep neural networks by associating neurons with hierarchical concepts, revealing how neurons encode both concrete and abstract concepts and their relationships.
Contribution
The paper introduces HINT, a novel approach that models hierarchical concept associations in neurons, enabling systematic analysis of concept embedding in deep networks.
Findings
Identifies collaborative neurons responsible for specific concepts.
Reveals multimodal neurons for different concepts.
Demonstrates applicability in saliency detection and adversarial explanation.
Abstract
To interpret deep networks, one main approach is to associate neurons with human-understandable concepts. However, existing methods often ignore the inherent relationships of different concepts (e.g., dog and cat both belong to animals), and thus lose the chance to explain neurons responsible for higher-level concepts (e.g., animal). In this paper, we study hierarchical concepts inspired by the hierarchical cognition process of human beings. To this end, we propose HIerarchical Neuron concepT explainer (HINT) to effectively build bidirectional associations between neurons and hierarchical concepts in a low-cost and scalable manner. HINT enables us to systematically and quantitatively study whether and how the implicit hierarchical relationships of concepts are embedded into neurons, such as identifying collaborative neurons responsible to one concept and multimodal neurons for different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science
