On Data-Dependent Random Features for Improved Generalization in   Supervised Learning

Shahin Shahrampour; Ahmad Beirami; Vahid Tarokh

arXiv:1712.07102·stat.ML·December 20, 2017

On Data-Dependent Random Features for Improved Generalization in Supervised Learning

Shahin Shahrampour, Ahmad Beirami, Vahid Tarokh

PDF

TL;DR

This paper introduces EERF, a data-dependent random feature selection method that improves generalization in supervised learning by reducing the number of features needed, with theoretical guarantees and empirical validation.

Contribution

The paper proposes the EERF algorithm, a simple, tuning-free, data-dependent approach that effectively explores feature space to enhance generalization in supervised learning.

Findings

01

EERF requires fewer random features to achieve similar error levels.

02

Theoretical proof shows EERF recovers the spectrum of the best fit.

03

Empirical results demonstrate improved performance on benchmark datasets.

Abstract

The randomized-feature approach has been successfully employed in large-scale kernel approximation and supervised learning. The distribution from which the random features are drawn impacts the number of features required to efficiently perform a learning task. Recently, it has been shown that employing data-dependent randomization improves the performance in terms of the required number of random features. In this paper, we are concerned with the randomized-feature approach in supervised learning for good generalizability. We propose the Energy-based Exploration of Random Features (EERF) algorithm based on a data-dependent score function that explores the set of possible features and exploits the promising regions. We prove that the proposed score function with high probability recovers the spectrum of the best fit within the model class. Our empirical results on several benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.