Multi-source Stable Variable Importance Measure via Adversarial Machine Learning
Zitao Wang, Nian Si, Zijian Guo, Molei Liu

TL;DR
This paper introduces MIMAL, a new adversarial learning framework for stable variable importance measurement across multiple data sources with heterogeneous distributions, applicable to various machine learning models.
Contribution
We propose MIMAL, a novel adversarial approach for stable multi-source variable importance estimation, with theoretical guarantees and practical validation in real-world data.
Findings
MIMAL provides stable importance measures across diverse data sources.
The estimator is asymptotically normal under general conditions.
Numerical studies demonstrate the effectiveness of MIMAL in various scenarios.
Abstract
The quantification and inference of predictive importance for exposure covariates have recently gained significant attention in the context of interpretable machine learning. Contemporary scientific investigations often involve data originating from multiple sources with distributional heterogeneity. It is imperative to introduce a new notation of the variable importance measure that is stable across diverse environments. In this paper, we introduce MIMAL (Multi-source Importance Measure via Adversarial Learning), a novel statistical framework designed to quantify the importance of exposure variables by maximizing the worst-case predictive reward across source mixtures. The proposed framework is adaptable to a broad spectrum of machine learning methodologies for both confounding adjustment and exposure effect characterization. We establish the asymptotic normality of the data-dependent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
