Learning Non-Linear Feature Maps

Dimitrios Athanasakis; John Shawe-Taylor; Delmiro Fernandez-Reyes

arXiv:1311.5636·cs.LG·November 25, 2013·1 cites

Learning Non-Linear Feature Maps

Dimitrios Athanasakis, John Shawe-Taylor, Delmiro Fernandez-Reyes

PDF

Open Access

TL;DR

This paper introduces randSel, a scalable randomized feature selection method with probabilistic guarantees, demonstrating improved performance in identifying relevant features in high-dimensional data.

Contribution

The paper presents randSel, a novel randomized algorithm for non-linear feature selection with theoretical guarantees and superior empirical performance.

Findings

01

randSel scales well to high-dimensional datasets

02

It effectively identifies relevant features

03

Outperforms several competitive methods in experiments

Abstract

Feature selection plays a pivotal role in learning, particularly in areas were parsimonious features can provide insight into the underlying process, such as biology. Recent approaches for non-linear feature selection employing greedy optimisation of Centred Kernel Target Alignment(KTA), while exhibiting strong results in terms of generalisation accuracy and sparsity, can become computationally prohibitive for high-dimensional datasets. We propose randSel, a randomised feature selection algorithm, with attractive scaling properties. Our theoretical analysis of randSel provides strong probabilistic guarantees for the correct identification of relevant features. Experimental results on real and artificial data, show that the method successfully identifies effective features, performing better than a number of competitive approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Machine Learning in Bioinformatics · Gene expression and cancer classification