Deep Semi-Random Features for Nonlinear Function Approximation
Kenji Kawaguchi, Bo Xie, Vikas Verma, Le Song

TL;DR
This paper introduces semi-random features for nonlinear function approximation, bridging the gap between deep learning and kernel methods, with theoretical guarantees and practical advantages in performance and efficiency.
Contribution
It provides a new semi-random feature approach with proven universality, optimization, and generalization properties, outperforming traditional random features and matching neural networks.
Findings
Semi-random features achieve universality and good approximation with increasing width.
Deep semi-random models have strong theoretical guarantees including generalization bounds.
Experiments show semi-random features match neural networks and outperform random features.
Abstract
We propose semi-random features for nonlinear function approximation. The flexibility of semi-random feature lies between the fully adjustable units in deep learning and the random features used in kernel methods. For one hidden layer models with semi-random features, we prove with no unrealistic assumptions that the model classes contain an arbitrarily good function as the width increases (universality), and despite non-convexity, we can find such a good function (optimization theory) that generalizes to unseen new data (generalization bound). For deep models, with no unrealistic assumptions, we prove universal approximation ability, a lower bound on approximation error, a partial optimization guarantee, and a generalization bound. Depending on the problems, the generalization bound of deep semi-random features can be exponentially better than the known bounds of deep ReLU nets; our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia?
