Decorrelated feature importance from local sample weighting
Benedikt Fr\"ohlich, Alison Durst, Merle Behr

TL;DR
This paper introduces losaw, a local sample weighting method that decorrelates features to improve feature importance accuracy and model performance in correlated data settings, applicable to various ML models.
Contribution
The paper proposes a novel local sample weighting technique, losaw, to enhance feature importance estimation and model accuracy in the presence of feature correlation.
Findings
losaw improves feature importance scores consistently.
losaw enhances out-of-distribution prediction accuracy.
losaw maintains in-distribution accuracy.
Abstract
Feature importance (FI) statistics provide a prominent and valuable method of insight into the decision process of machine learning (ML) models, but their effectiveness has well-known limitations when correlation is present among the features in the training data. In this case, the FI often tends to be distributed among all features which are in correlation with the response-generating signal features. Even worse, if multiple signal features are in strong correlation with a noise feature, while being only modestly correlated with one another, this can result in a noise feature having a distinctly larger FI score than any signal feature. Here we propose local sample weighting (losaw) which can flexibly be integrated into many ML algorithms to improve FI scores in the presence of feature correlation in the training data. Our approach is motivated from inverse probability weighting in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
