Partial Information Decomposition for Data Interpretability and Feature Selection
Charles Westphal, Stephen Hailes, Mirco Musolesi

TL;DR
This paper presents Partial Information Decomposition of Features (PIDF), a new method for data interpretability and feature selection that analyzes shared, synergistic, and redundant information among features.
Contribution
The paper introduces PIDF, a novel framework that simultaneously assesses feature importance, synergy, and redundancy, enhancing interpretability and selection processes.
Findings
PIDF effectively identifies feature contributions in synthetic data.
PIDF reveals complex feature interactions in genetics and neuroscience case studies.
Demonstrates improved interpretability over traditional importance measures.
Abstract
In this paper, we introduce Partial Information Decomposition of Features (PIDF), a new paradigm for simultaneous data interpretability and feature selection. Contrary to traditional methods that assign a single importance value, our approach is based on three metrics per feature: the mutual information shared with the target variable, the feature's contribution to synergistic information, and the amount of this information that is redundant. In particular, we develop a novel procedure based on these three metrics, which reveals not only how features are correlated with the target but also the additional and overlapping information provided by considering them in combination with other features. We extensively evaluate PIDF using both synthetic and real-world data, demonstrating its potential applications and effectiveness, by considering case studies from genetics and neuroscience.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
