Towards Ultrahigh Dimensional Feature Selection for Big Data
Mingkui Tan, Ivor W. Tsang, Li Wang

TL;DR
This paper introduces a novel adaptive feature scaling method for ultrahigh-dimensional feature selection in Big Data, reformulating the problem as a convex SIP and employing a feature generating paradigm for efficiency and effectiveness.
Contribution
It proposes a new feature generating paradigm with a primal MKL approach, enabling scalable, nonlinear, and group-based feature selection with guaranteed convergence and lower bias.
Findings
Outperforms state-of-the-art methods in accuracy and efficiency.
Successfully handles datasets with tens of millions of data points and trillions of features.
Demonstrates effectiveness in both synthetic and real-world scenarios.
Abstract
In this paper, we present a new adaptive feature scaling scheme for ultrahigh-dimensional feature selection on Big Data. To solve this problem effectively, we first reformulate it as a convex semi-infinite programming (SIP) problem and then propose an efficient \emph{feature generating paradigm}. In contrast with traditional gradient-based approaches that conduct optimization on all input features, the proposed method iteratively activates a group of features and solves a sequence of multiple kernel learning (MKL) subproblems of much reduced scale. To further speed up the training, we propose to solve the MKL subproblems in their primal forms through a modified accelerated proximal gradient approach. Due to such an optimization scheme, some efficient cache techniques are also developed. The feature generating paradigm can guarantee that the solution converges globally under mild…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
