Structuring Relevant Feature Sets with Multiple Model Learning
Jun Wang, Alexandros Kalousis

TL;DR
This paper introduces a novel learning paradigm that uncovers underlying structures in relevant feature sets, distinguishing non-replaceable features from interchangeable ones, enhancing interpretability in high-dimensional data analysis.
Contribution
It proposes a new algorithm that learns multiple disjoint models with regularization to reveal feature structures, including non-replaceable and functionally similar feature sets.
Findings
Models exhibit disjointness and high predictive agreement.
Structured feature sets maintain performance comparable to full relevant features.
Approach is validated on various high-dimensional datasets.
Abstract
Feature selection is one of the most prominent learning tasks, especially in high-dimensional datasets in which the goal is to understand the mechanisms that underly the learning dataset. However most of them typically deliver just a flat set of relevant features and provide no further information on what kind of structures, e.g. feature groupings, might underly the set of relevant features. In this paper we propose a new learning paradigm in which our goal is to uncover the structures that underly the set of relevant features for a given learning problem. We uncover two types of features sets, non-replaceable features that contain important information about the target variable and cannot be replaced by other features, and functionally similar features sets that can be used interchangeably in learned models, given the presence of the non-replaceable features, with no change in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and Data Classification
