Explainability in AI Based Applications: A Framework for Comparing Different Techniques
Arne Grobrugge, Nidhi Mishra, Johannes Jakubik, Gerhard Satzger

TL;DR
This paper introduces a practical framework and a novel metric for comparing the agreement of different explainability techniques in AI, aiding better selection and integration in business applications.
Contribution
It proposes a new method to assess the agreement among explainability techniques and provides a comprehensive comparative analysis on deep learning models.
Findings
A novel metric for explainability agreement measurement
Comparison of six explainability techniques on Vision Transformer models
Framework facilitates better selection of explainability methods in practice
Abstract
The integration of artificial intelligence into business processes has significantly enhanced decision-making capabilities across various industries such as finance, healthcare, and retail. However, explaining the decisions made by these AI systems poses a significant challenge due to the opaque nature of recent deep learning models, which typically function as black boxes. To address this opacity, a multitude of explainability techniques have emerged. However, in practical business applications, the challenge lies in selecting an appropriate explainability method that balances comprehensibility with accuracy. This paper addresses the practical need of understanding differences in the output of explainability techniques by proposing a novel method for the assessment of the agreement of different explainability techniques. Based on our proposed methods, we provide a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Label Smoothing · Byte Pair Encoding · Multi-Head Attention · Softmax · Adam · Dropout · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Transformer
