Increasing Interpretability of Neural Networks By Approximating Human Visual Saliency
Aidan Boyd, Mohamed Trabelsi, Huseyin Uzunalioglu, Dan Kushnir

TL;DR
This paper proposes a method combining saliency guidance and active learning to improve neural network interpretability while significantly reducing the need for costly human annotations.
Contribution
It introduces a novel approach that integrates saliency information with active learning to enhance interpretability and performance efficiently.
Findings
Models with saliency guidance show up to 30% interpretability improvement.
Active learning reduces human annotation effort by 80%.
The approach is effective across multiple datasets and criteria.
Abstract
Understanding specifically where a model focuses on within an image is critical for human interpretability of the decision-making process. Deep learning-based solutions are prone to learning coincidental correlations in training datasets, causing over-fitting and reducing the explainability. Recent advances have shown that guiding models to human-defined regions of saliency within individual images significantly increases performance and interpretability. Human-guided models also exhibit greater generalization capabilities, as coincidental dataset features are avoided. Results show that models trained with saliency incorporation display an increase in interpretability of up to 30% over models trained without saliency information. The collection of this saliency information, however, can be costly, laborious and in some cases infeasible. To address this limitation, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Visual Attention and Saliency Detection · Neural Networks and Applications
