TL;DR
This paper introduces a synthetic benchmark dataset for evaluating neural network attribution methods in geoscience regression problems, enabling objective assessment of XAI techniques with known ground truth.
Contribution
It presents a novel framework for creating attribution benchmark datasets in geosciences, facilitating more accurate evaluation of XAI methods with known ground truth.
Findings
Generated a large synthetic dataset for geoscience regression problems.
Compared various XAI methods against ground truth to assess performance.
Highlighted the importance of objective benchmarks for model trust and scientific discovery.
Abstract
Despite the increasingly successful application of neural networks to many problems in the geosciences, their complex and nonlinear structure makes the interpretation of their predictions difficult, which limits model trust and does not allow scientists to gain physical insights about the problem at hand. Many different methods have been introduced in the emerging field of eXplainable Artificial Intelligence (XAI), which aim at attributing the network s prediction to specific features in the input domain. XAI methods are usually assessed by using benchmark datasets (like MNIST or ImageNet for image classification). However, an objective, theoretically derived ground truth for the attribution is lacking for most of these datasets, making the assessment of XAI in many cases subjective. Also, benchmark datasets specifically designed for problems in geosciences are rare. Here, we provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
