Ultra-marginal Feature Importance: Learning from Data with Causal Guarantees
Joseph Janssen, Vincent Guan, Elina Robeva

TL;DR
This paper introduces ultra-marginal feature importance (UMFI), a new method that improves causal interpretability and computational efficiency over existing approaches by leveraging dependence removal techniques.
Contribution
The paper develops UMFI, enhancing theoretical properties, performance, and runtime of marginal contribution feature importance by integrating dependence removal techniques from AI fairness.
Findings
UMFI satisfies proposed axioms for causal and associative explanations.
UMFI outperforms MCI on real and simulated data, especially with correlated features.
UMFI reduces runtime from exponential to super-linear.
Abstract
Scientists frequently prioritize learning from data rather than training the best possible model; however, research in machine learning often prioritizes the latter. Marginal contribution feature importance (MCI) was developed to break this trend by providing a useful framework for quantifying the relationships in data. In this work, we aim to improve upon the theoretical properties, performance, and runtime of MCI by introducing ultra-marginal feature importance (UMFI), which uses dependence removal techniques from the AI fairness literature as its foundation. We first propose axioms for feature importance methods that seek to explain the causal and associative relationships in data, and we prove that UMFI satisfies these axioms under basic assumptions. We then show on real and simulated data that UMFI performs better than MCI, especially in the presence of correlated interactions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference
MethodsLinear Regression
