Locally Testing Model Detections for Semantic Global Concepts
Franz Motzkus, Georgii Mikriukov, Christian Hellert, Ute Schmid

TL;DR
This paper introduces a framework called glCA that tests DNNs for specific semantic concepts at a local level, linking global concept encodings to individual input processing to improve model interpretability and safety.
Contribution
The paper presents a novel global-to-local concept attribution method that conditions local explanations on predefined semantic concepts in the model's latent space.
Findings
Major differences in local perception of global concepts were identified.
The approach enables detailed understanding of concept encoding within models.
Further research is needed to improve semantic concept encodings.
Abstract
Ensuring the quality of black-box Deep Neural Networks (DNNs) has become ever more significant, especially in safety-critical domains such as automated driving. While global concept encodings generally enable a user to test a model for a specific concept, linking global concept encodings to the local processing of single network inputs reveals their strengths and limitations. Our proposed framework global-to-local Concept Attribution (glCA) uses approaches from local (why a specific prediction originates) and global (how a model works generally) eXplainable Artificial Intelligence (xAI) to test DNNs for a predefined semantical concept locally. The approach allows for conditioning local, post-hoc explanations on predefined semantic concepts encoded as linear directions in the model's latent space. Pixel-exact scoring concerning the global concept usage assists the tester in further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Topic Modeling · Natural Language Processing Techniques
