Attention-like feature explanation for tabular data
Andrei V. Konstantinov, Lev V. Utkin

TL;DR
AFEX is a novel explanation system for tabular data that uses attention-like mechanisms and neural subnetworks to produce interpretable feature shape functions and identify feature interactions, trained end-to-end for efficiency.
Contribution
It introduces AFEX, a new neural network-based method for local and global explanations of tabular data models, with a unique attention-like mechanism and interaction detection.
Findings
AFEX effectively explains black-box models on synthetic and real datasets.
It identifies feature interactions through pairwise shape function multiplications.
AFEX requires only one training pass on the dataset, enabling efficient explanations.
Abstract
A new method for local and global explanation of the machine learning black-box model predictions by tabular data is proposed. It is implemented as a system called AFEX (Attention-like Feature EXplanation) and consisting of two main parts. The first part is a set of the one-feature neural subnetworks which aim to get a specific representation for every feature in the form of a basis of shape functions. The subnetworks use shortcut connections with trainable parameters to improve the network performance. The second part of AFEX produces shape functions of features as the weighted sum of the basis shape functions where weights are computed by using an attention-like mechanism. AFEX identifies pairwise interactions between features based on pairwise multiplications of shape functions corresponding to different features. A modification of AFEX with incorporating an additional surrogate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
