From Confusion to Clarity: ProtoScore -- A Framework for Evaluating Prototype-Based XAI
Helena Monke, Benjamin Sae-Chew, Benjamin Fresz, Marco F. Huber

TL;DR
ProtoScore is a comprehensive framework designed to evaluate prototype-based explainable AI methods, especially for time series data, enabling fair comparison and aiding practitioners in selecting suitable explanation techniques.
Contribution
The paper introduces ProtoScore, a standardized benchmarking framework for prototype-based XAI methods, addressing the lack of objective evaluation tools in the field.
Findings
Provides a systematic way to compare prototype-based XAI methods
Facilitates fair evaluation across different data types, especially time series
Helps practitioners choose appropriate explanation methods
Abstract
The complexity and opacity of neural networks (NNs) pose significant challenges, particularly in high-stakes fields such as healthcare, finance, and law, where understanding decision-making processes is crucial. To address these issues, the field of explainable artificial intelligence (XAI) has developed various methods aimed at clarifying AI decision-making, thereby facilitating appropriate trust and validating the fairness of outcomes. Among these methods, prototype-based explanations offer a promising approach that uses representative examples to elucidate model behavior. However, a critical gap exists regarding standardized benchmarks to objectively compare prototype-based XAI methods, especially in the context of time series data. This lack of reliable benchmarks results in subjective evaluations, hindering progress in the field. We aim to establish a robust framework, ProtoScore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Adversarial Robustness in Machine Learning
