Feature Importance Estimation with Self-Attention Networks

Bla\v{z} \v{S}krlj; Sa\v{s}o D\v{z}eroski; Nada Lavra\v{c}; Matej; Petkovi\v{c}

arXiv:2002.04464·cs.LG·January 19, 2021·27 cites

Feature Importance Estimation with Self-Attention Networks

Bla\v{z} \v{S}krlj, Sa\v{s}o D\v{z}eroski, Nada Lavra\v{c}, Matej, Petkovi\v{c}

PDF

Open Access

TL;DR

This paper investigates using self-attention networks to estimate feature importance in tabular data, comparing their effectiveness with traditional methods and demonstrating their ability to identify relevant features and interactions.

Contribution

It introduces a novel SAN-based approach for feature importance estimation and provides the first scale-free comparison with established methods across multiple datasets.

Findings

01

SANs identify similar high-ranked features as traditional methods

02

SANs can detect larger feature interactions relevant for prediction

03

SANs sometimes outperform baselines in predictive accuracy

Abstract

Black-box neural network models are widely used in industry and science, yet are hard to understand and interpret. Recently, the attention mechanism was introduced, offering insights into the inner workings of neural language models. This paper explores the use of attention-based neural networks mechanism for estimating feature importance, as means for explaining the models learned from propositional (tabular) data. Feature importance estimates, assessed by the proposed Self-Attention Network (SAN) architecture, are compared with the established ReliefF, Mutual Information and Random Forest-based estimates, which are widely used in practice for model interpretation. For the first time we conduct scale-free comparisons of feature importance estimates across algorithms on ten real and synthetic data sets to study the similarities and differences of the resulting feature importance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Machine Learning and Data Classification