Saliency Methods are Encoders: Analysing Logical Relations Towards Interpretation
Leonid Schwenke, Martin Atzmueller

TL;DR
This paper introduces a logical reasoning-based testing framework for evaluating saliency methods, revealing how these methods encode class-relevant information and highlighting limitations of current evaluation practices.
Contribution
It proposes a novel evaluation approach using logical datasets and new metrics to analyze saliency methods' treatment of information, addressing biases in existing evaluation methods.
Findings
Saliency methods encode classification information in score orderings.
Current evaluation practices may be biased and insufficient.
Logical datasets reveal how methods handle complementary and redundant info.
Abstract
With their increase in performance, neural network architectures also become more complex, necessitating explainability. Therefore, many new and improved methods are currently emerging, which often generate so-called saliency maps in order to improve interpretability. Those methods are often evaluated by visual expectations, yet this typically leads towards a confirmation bias. Due to a lack of a general metric for explanation quality, non-accessible ground truth data about the model's reasoning and the large amount of involved assumptions, multiple works claim to find flaws in those methods. However, this often leads to unfair comparison metrics. Additionally, the complexity of most datasets (mostly images or text) is often so high, that approximating all possible explanations is not feasible. For those reasons, this paper introduces a test for saliency map evaluation: proposing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Science and Mapping · Semantic Web and Ontologies
