Neural Network based on Automatic Differentiation Transformation of   Numeric Iterate-to-Fixedpoint

Mansura Habiba; Barak A. Pearlmutter

arXiv:2111.00326·cs.LG·November 2, 2021

Neural Network based on Automatic Differentiation Transformation of Numeric Iterate-to-Fixedpoint

Mansura Habiba, Barak A. Pearlmutter

PDF

Open Access

TL;DR

This paper introduces a neural network architecture with dynamic depth control via an iterate-to-fixed-point operator, enabling bidirectional information flow and improved handling of long-term dependencies.

Contribution

It presents a novel neural network with 'temporal wormhole' connections that allow information flow up and down, enhancing depth control and addressing vanishing gradients.

Findings

01

Competitive performance on long-term dependency tasks

02

Reduced training time for easier inputs

03

Effective handling of vanishing gradients

Abstract

This work proposes a Neural Network model that can control its depth using an iterate-to-fixed-point operator. The architecture starts with a standard layered Network but with added connections from current later to earlier layers, along with a gate to make them inactive under most circumstances. These ``temporal wormhole'' connections create a shortcut that allows the Neural Network to use the information available at deeper layers and re-do earlier computations with modulated inputs. End-to-end training is accomplished by using appropriate calculations for a numeric iterate-to-fixed-point operator. In a typical case, where the ``wormhole'' connections are inactive, this is inexpensive; but when they are active, the network takes a longer time to settle down, and the gradient calculation is also more laborious, with an effect similar to making the network deeper. In contrast to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural dynamics and brain function · Neural Networks and Reservoir Computing