Improving the Interpretability of fMRI Decoding using Deep Neural Networks and Adversarial Robustness
Patrick McClure, Dustin Moraczewski, Ka Chun Lam, Adam Thomas,, Francisco Pereira

TL;DR
This paper enhances the interpretability of fMRI decoding by developing a robust adversarial training method for deep neural networks, improving saliency map quality and evaluation in neuroimaging analysis.
Contribution
It introduces a novel adversarial training approach to improve DNN robustness and interpretability in fMRI decoding, along with new evaluation procedures for saliency maps.
Findings
Saliency maps vary widely in interpretability across methods.
DNN saliency maps outperform linear models in interpretability.
Adversarial training improves the quality of saliency maps.
Abstract
Deep neural networks (DNNs) are being increasingly used to make predictions from functional magnetic resonance imaging (fMRI) data. However, they are widely seen as uninterpretable "black boxes", as it can be difficult to discover what input information is used by the DNN in the process, something important in both cognitive neuroscience and clinical applications. A saliency map is a common approach for producing interpretable visualizations of the relative importance of input features for a prediction. However, methods for creating maps often fail due to DNNs being sensitive to input noise, or by focusing too much on the input and too little on the model. It is also challenging to evaluate how well saliency maps correspond to the truly relevant input information, as ground truth is not always available. In this paper, we review a variety of methods for producing gradient-based saliency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Functional Brain Connectivity Studies · Adversarial Robustness in Machine Learning
MethodsInterpretability
