A Kernel-Expanded Stochastic Neural Network

Yan Sun; Faming Liang

arXiv:2201.05319·stat.ML·January 17, 2022

A Kernel-Expanded Stochastic Neural Network

Yan Sun, Faming Liang

PDF

Open Access 1 Repo

TL;DR

The paper introduces K-StoNet, a novel neural network model that integrates support vector regression and kernel methods to overcome local minima issues and facilitate uncertainty quantification, with proven convergence guarantees.

Contribution

It presents a new kernel-expanded stochastic neural network model that reformulates training as convex problems and provides theoretical convergence guarantees.

Findings

01

K-StoNet converges to the global optimum asymptotically.

02

The model effectively quantifies prediction uncertainty.

03

Experimental results demonstrate improved training and prediction performance.

Abstract

The deep neural network suffers from many fundamental issues in machine learning. For example, it often gets trapped into a local minimum in training, and its prediction uncertainty is hard to be assessed. To address these issues, we propose the so-called kernel-expanded stochastic neural network (K-StoNet) model, which incorporates support vector regression (SVR) as the first hidden layer and reformulates the neural network as a latent variable model. The former maps the input vector into an infinite dimensional feature space via a radial basis function (RBF) kernel, ensuring absence of local minima on its training loss surface. The latter breaks the high-dimensional nonconvex neural network training problem into a series of low-dimensional convex optimization problems, and enables its prediction uncertainty easily assessed. The K-StoNet can be easily trained using the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sylydya/a-kernel-expanded-stochastic-neural-network
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Advanced Neural Network Applications