# Feed-forward approximations to dynamic recurrent network architectures

**Authors:** Dylan Richard Muir

arXiv: 1704.06645 · 2019-11-18

## TL;DR

This paper proposes a method to approximate the responses of recurrent neural networks using two-layer feedforward networks, enabling faster and deterministic computation of complex dynamic responses.

## Contribution

The authors introduce a novel approach to approximate recurrent network responses with simple feedforward architectures by training on fixed-point responses, improving computational efficiency.

## Key findings

- Feedforward networks can approximate recurrent network responses.
- Approximations enable linear and deterministic evaluation.
- The method captures complex dynamics like competition and noise rejection.

## Abstract

Recurrent neural network architectures can have useful computational properties, with complex temporal dynamics and input-sensitive attractor states. However, evaluation of recurrent dynamic architectures requires solution of systems of differential equations, and the number of evaluations required to determine their response to a given input can vary with the input, or can be indeterminate altogether in the case of oscillations or instability. In feed-forward networks, by contrast, only a single pass through the network is needed to determine the response to a given input. Modern machine-learning systems are designed to operate efficiently on feed-forward architectures. We hypothesised that two-layer feedforward architectures with simple, deterministic dynamics could approximate the responses of single-layer recurrent network architectures. By identifying the fixed-point responses of a given recurrent network, we trained two-layer networks to directly approximate the fixed-point response to a given input. These feed-forward networks then embodied useful computations, including competitive interactions, information transformations and noise rejection. Our approach was able to find useful approximations to recurrent networks, which can then be evaluated in linear and deterministic time complexity.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.06645/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/1704.06645/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1704.06645/full.md

---
Source: https://tomesphere.com/paper/1704.06645