Relational Local Explanations

Vadim Borisov; Gjergji Kasneci

arXiv:2212.12374·cs.LG·February 14, 2023

Relational Local Explanations

Vadim Borisov, Gjergji Kasneci

PDF

Open Access

TL;DR

This paper introduces a novel, model-agnostic, permutation-based feature attribution method that considers inter-variable relationships, providing deeper insights into model predictions for structured data like images and text.

Contribution

It proposes a new relational analysis-based attribution approach that captures latent feature relationships, improving interpretability over traditional independent attribution methods.

Findings

01

Outperforms state-of-the-art attribution techniques on image and text data.

02

Provides broader insights into model decision-making processes.

03

Validated through extensive experimental evaluations.

Abstract

The majority of existing post-hoc explanation approaches for machine learning models produce independent, per-variable feature attribution scores, ignoring a critical inherent characteristics of homogeneously structured data, such as visual or text data: there exist latent inter-variable relationships between features. In response, we develop a novel model-agnostic and permutation-based feature attribution approach based on the relational analysis between input variables. As a result, we are able to gain a broader insight into the predictions and decisions of machine learning models. Experimental evaluations of our framework in comparison with state-of-the-art attribution techniques on various setups involving both image and text data modalities demonstrate the effectiveness and validity of our method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Radiomics and Machine Learning in Medical Imaging