Top-$k$ Regularization for Supervised Feature Selection
Xinxing Wu, Qiang Cheng

TL;DR
This paper proposes a novel top-$k$ regularization method for supervised feature selection that enhances the selection of informative features and models complex nonlinear relationships, with proven theoretical bounds and extensive empirical validation.
Contribution
Introduces a new top-$k$ regularization approach that improves feature selection by balancing representativeness and inter-feature correlations, with theoretical guarantees and practical effectiveness.
Findings
Effective in selecting informative features across datasets
Provides a uniform approximation error bound for high-dimensional functions
Demonstrates stability and superiority over existing methods
Abstract
Feature selection identifies subsets of informative features and reduces dimensions in the original feature space, helping provide insights into data generation or a variety of domain problems. Existing methods mainly depend on feature scoring functions or sparse regularizations; nonetheless, they have limited ability to reconcile the representativeness and inter-correlations of features. In this paper, we introduce a novel, simple yet effective regularization approach, named top- regularization, to supervised feature selection in regression and classification tasks. Structurally, the top- regularization induces a sub-architecture on the architecture of a learning model to boost its ability to select the most informative features and model complex nonlinear relationships simultaneously. Theoretically, we derive and mathematically prove a uniform approximation error bound for using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Face and Expression Recognition
MethodsFeature Selection
