Auto-Rotating Perceptrons

Daniel Saromo; Elizabeth Villota; Edwin Villanueva

arXiv:1910.02483·cs.LG·October 9, 2019

Auto-Rotating Perceptrons

Daniel Saromo, Elizabeth Villota, Edwin Villanueva

PDF

Open Access 1 Repo

TL;DR

This paper introduces the auto-rotating perceptron (ARP), a novel neuron design that prevents saturation in activation functions, thereby improving training efficiency of deep neural networks with sigmoid activations.

Contribution

The paper presents the ARP, a new perceptron design that maintains neurons in the dynamic region of activation functions, addressing vanishing gradient issues without altering the network inference structure.

Findings

01

ARP units improve learning performance over classic perceptrons

02

Networks with ARP units converge faster in experiments

03

ARP effectively mitigates vanishing gradient problems

Abstract

This paper proposes an improved design of the perceptron unit to mitigate the vanishing gradient problem. This nuisance appears when training deep multilayer perceptron networks with bounded activation functions. The new neuron design, named auto-rotating perceptron (ARP), has a mechanism to ensure that the node always operates in the dynamic region of the activation function, by avoiding saturation of the perceptron. The proposed method does not change the inference structure learned at each neuron. We test the effect of using ARP units in some network architectures which use the sigmoid activation function. The results support our hypothesis that neural networks with ARP units can achieve better learning performance than equivalent models with classic perceptrons.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DanielSaromo/ARP
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Face and Expression Recognition

MethodsTest · Sigmoid Activation