Causality-Based Feature Importance Quantifying Methods: PN-FI, PS-FI and PNS-FI
Shuxian Du, Yaxiu Sun, Changyi Du

TL;DR
This paper introduces three causality-based feature importance measures—PN-FI, PS-FI, and PNS-FI—using probabilities of necessity, sufficiency, and their conjunction, validated through experiments on image features.
Contribution
It proposes novel causality-inspired feature importance metrics, integrating probabilistic causality concepts into feature selection for image tasks.
Findings
Feature importance bounds are tight intervals.
Dog eyes feature has the highest importance.
PNS and PN bounds are tighter than PS bounds.
Abstract
In the current ML field models are getting larger and more complex, and data used for model training are also getting larger in quantity and higher in dimensions. Therefore, in order to train better models, and save training time and computational resources, a good Feature Selection (FS) method in the preprocessing stage is necessary. Feature importance (FI) is of great importance since it is the basis of feature selection. Therefore, this paper creatively introduces the calculation of PN (the probability of Necessity), PN (the probability of Sufficiency), and PNS (the probability of Necessity and Sufficiency) of Causality into quantifying feature importance and creates 3 new FI measuring methods, PN-FI, which means how much importance a feature has in image recognition tasks, PS-FI that means how much importance a feature has in image generating tasks, and PNS-FI which measures both.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies
MethodsFeature Selection
