Neural Network based on Automatic Differentiation Transformation of Numeric Iterate-to-Fixedpoint
Mansura Habiba, Barak A. Pearlmutter

TL;DR
This paper introduces a neural network architecture with dynamic depth control via an iterate-to-fixed-point operator, enabling bidirectional information flow and improved handling of long-term dependencies.
Contribution
It presents a novel neural network with 'temporal wormhole' connections that allow information flow up and down, enhancing depth control and addressing vanishing gradients.
Findings
Competitive performance on long-term dependency tasks
Reduced training time for easier inputs
Effective handling of vanishing gradients
Abstract
This work proposes a Neural Network model that can control its depth using an iterate-to-fixed-point operator. The architecture starts with a standard layered Network but with added connections from current later to earlier layers, along with a gate to make them inactive under most circumstances. These ``temporal wormhole'' connections create a shortcut that allows the Neural Network to use the information available at deeper layers and re-do earlier computations with modulated inputs. End-to-end training is accomplished by using appropriate calculations for a numeric iterate-to-fixed-point operator. In a typical case, where the ``wormhole'' connections are inactive, this is inexpensive; but when they are active, the network takes a longer time to settle down, and the gradient calculation is also more laborious, with an effect similar to making the network deeper. In contrast to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural dynamics and brain function · Neural Networks and Reservoir Computing
