Deep Semi-Random Features for Nonlinear Function Approximation

Kenji Kawaguchi; Bo Xie; Vikas Verma; Le Song

arXiv:1702.08882·cs.LG·November 22, 2017·2 cites

Deep Semi-Random Features for Nonlinear Function Approximation

Kenji Kawaguchi, Bo Xie, Vikas Verma, Le Song

PDF

Open Access 1 Repo

TL;DR

This paper introduces semi-random features for nonlinear function approximation, bridging the gap between deep learning and kernel methods, with theoretical guarantees and practical advantages in performance and efficiency.

Contribution

It provides a new semi-random feature approach with proven universality, optimization, and generalization properties, outperforming traditional random features and matching neural networks.

Findings

01

Semi-random features achieve universality and good approximation with increasing width.

02

Deep semi-random models have strong theoretical guarantees including generalization bounds.

03

Experiments show semi-random features match neural networks and outperform random features.

Abstract

We propose semi-random features for nonlinear function approximation. The flexibility of semi-random feature lies between the fully adjustable units in deep learning and the random features used in kernel methods. For one hidden layer models with semi-random features, we prove with no unrealistic assumptions that the model classes contain an arbitrarily good function as the width increases (universality), and despite non-convexity, we can find such a good function (optimization theory) that generalizes to unseen new data (generalization bound). For deep models, with no unrealistic assumptions, we prove universal approximation ability, a lower bound on approximation error, a partial optimization guarantee, and a generalization bound. Depending on the problems, the generalization bound of deep semi-random features can be exponentially better than the known bounds of deep ReLU nets; our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zixu1986/semi-random
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia?