Causality-Based Feature Importance Quantifying Methods: PN-FI, PS-FI and   PNS-FI

Shuxian Du; Yaxiu Sun; Changyi Du

arXiv:2308.14474·cs.AI·September 19, 2023

Causality-Based Feature Importance Quantifying Methods: PN-FI, PS-FI and PNS-FI

Shuxian Du, Yaxiu Sun, Changyi Du

PDF

Open Access

TL;DR

This paper introduces three causality-based feature importance measures—PN-FI, PS-FI, and PNS-FI—using probabilities of necessity, sufficiency, and their conjunction, validated through experiments on image features.

Contribution

It proposes novel causality-inspired feature importance metrics, integrating probabilistic causality concepts into feature selection for image tasks.

Findings

01

Feature importance bounds are tight intervals.

02

Dog eyes feature has the highest importance.

03

PNS and PN bounds are tighter than PS bounds.

Abstract

In the current ML field models are getting larger and more complex, and data used for model training are also getting larger in quantity and higher in dimensions. Therefore, in order to train better models, and save training time and computational resources, a good Feature Selection (FS) method in the preprocessing stage is necessary. Feature importance (FI) is of great importance since it is the basis of feature selection. Therefore, this paper creatively introduces the calculation of PN (the probability of Necessity), PN (the probability of Sufficiency), and PNS (the probability of Necessity and Sufficiency) of Causality into quantifying feature importance and creates 3 new FI measuring methods, PN-FI, which means how much importance a feature has in image recognition tasks, PS-FI that means how much importance a feature has in image generating tasks, and PNS-FI which measures both.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies

MethodsFeature Selection