Variational Neural Networks: Every Layer and Neuron Can Be Unique

Yiwei Li; Enzhi Li

arXiv:1810.06120·cs.LG·October 16, 2018

Variational Neural Networks: Every Layer and Neuron Can Be Unique

Yiwei Li, Enzhi Li

PDF

Open Access

TL;DR

This paper introduces variational neural networks where each layer and neuron can have a unique activation function, optimized through gradient descent to improve neural network performance.

Contribution

It proposes a novel framework representing activation functions as linear combinations of candidates, with derived gradient formulas for optimization.

Findings

01

Activation functions can be optimized per neuron.

02

Gradient formulas enable efficient training of variational neural networks.

03

Potential for improved neural network performance.

Abstract

The choice of activation function can significantly influence the performance of neural networks. The lack of guiding principles for the selection of activation function is lamentable. We try to address this issue by introducing our variational neural networks, where the activation function is represented as a linear combination of possible candidate functions, and an optimal activation is obtained via minimization of a loss function using gradient descent method. The gradient formulae for the loss function with respect to these expansion coefficients are central for the implementation of gradient descent algorithm, and here we derive these gradient formulae.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications