Multi-source Stable Variable Importance Measure via Adversarial Machine   Learning

Zitao Wang; Nian Si; Zijian Guo; Molei Liu

arXiv:2409.07380·stat.ME·December 31, 2024

Multi-source Stable Variable Importance Measure via Adversarial Machine Learning

Zitao Wang, Nian Si, Zijian Guo, Molei Liu

PDF

Open Access

TL;DR

This paper introduces MIMAL, a new adversarial learning framework for stable variable importance measurement across multiple data sources with heterogeneous distributions, applicable to various machine learning models.

Contribution

We propose MIMAL, a novel adversarial approach for stable multi-source variable importance estimation, with theoretical guarantees and practical validation in real-world data.

Findings

01

MIMAL provides stable importance measures across diverse data sources.

02

The estimator is asymptotically normal under general conditions.

03

Numerical studies demonstrate the effectiveness of MIMAL in various scenarios.

Abstract

The quantification and inference of predictive importance for exposure covariates have recently gained significant attention in the context of interpretable machine learning. Contemporary scientific investigations often involve data originating from multiple sources with distributional heterogeneity. It is imperative to introduce a new notation of the variable importance measure that is stable across diverse environments. In this paper, we introduce MIMAL (Multi-source Importance Measure via Adversarial Learning), a novel statistical framework designed to quantify the importance of exposure variables by maximizing the worst-case predictive reward across source mixtures. The proposed framework is adaptable to a broad spectrum of machine learning methodologies for both confounding adjustment and exposure effect characterization. We establish the asymptotic normality of the data-dependent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications