Auto-Rotating Perceptrons
Daniel Saromo, Elizabeth Villota, Edwin Villanueva

TL;DR
This paper introduces the auto-rotating perceptron (ARP), a novel neuron design that prevents saturation in activation functions, thereby improving training efficiency of deep neural networks with sigmoid activations.
Contribution
The paper presents the ARP, a new perceptron design that maintains neurons in the dynamic region of activation functions, addressing vanishing gradient issues without altering the network inference structure.
Findings
ARP units improve learning performance over classic perceptrons
Networks with ARP units converge faster in experiments
ARP effectively mitigates vanishing gradient problems
Abstract
This paper proposes an improved design of the perceptron unit to mitigate the vanishing gradient problem. This nuisance appears when training deep multilayer perceptron networks with bounded activation functions. The new neuron design, named auto-rotating perceptron (ARP), has a mechanism to ensure that the node always operates in the dynamic region of the activation function, by avoiding saturation of the perceptron. The proposed method does not change the inference structure learned at each neuron. We test the effect of using ARP units in some network architectures which use the sigmoid activation function. The results support our hypothesis that neural networks with ARP units can achieve better learning performance than equivalent models with classic perceptrons.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Face and Expression Recognition
MethodsTest · Sigmoid Activation
