Ensemble Neural Networks (ENN): A gradient-free stochastic method

Yuntian Chen; Haibin Chang; Meng Jin; Dongxiao Zhang

arXiv:1908.01113·stat.ML·November 11, 2019

Ensemble Neural Networks (ENN): A gradient-free stochastic method

Yuntian Chen, Haibin Chang, Meng Jin, Dongxiao Zhang

PDF

TL;DR

The paper introduces Ensemble Neural Networks (ENN), a gradient-free stochastic method based on covariance matrices and Bayesian principles, enhancing robustness, uncertainty quantification, and applicability to complex models.

Contribution

It develops ENN, a novel gradient-free neural network method using covariance matrices and EnRML, enabling uncertainty quantification and broad compatibility with existing neural network architectures.

Findings

01

ENN outperforms traditional Bayesian neural networks.

02

ENN is robust with small training datasets.

03

ENN can be integrated with various neural network models.

Abstract

In this study, an efficient stochastic gradient-free method, the ensemble neural networks (ENN), is developed. In the ENN, the optimization process relies on covariance matrices rather than derivatives. The covariance matrices are calculated by the ensemble randomized maximum likelihood algorithm (EnRML), which is an inverse modeling method. The ENN is able to simultaneously provide estimations and perform uncertainty quantification since it is built under the Bayesian framework. The ENN is also robust to small training data size because the ensemble of stochastic realizations essentially enlarges the training dataset. This constitutes a desirable characteristic, especially for real-world engineering applications. In addition, the ENN does not require the calculation of gradients, which enables the use of complicated neuron models and loss functions in neural networks. We experimentally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.