A Technical Note on the Architectural Effects on Maximum Dependency   Lengths of Recurrent Neural Networks

Jonathan S. Kent; Michael M. Murray

arXiv:2408.11946·cs.NE·August 23, 2024

A Technical Note on the Architectural Effects on Maximum Dependency Lengths of Recurrent Neural Networks

Jonathan S. Kent, Michael M. Murray

PDF

Open Access

TL;DR

This paper introduces a methodology to measure the maximum dependency length in RNNs and examines how architectural modifications influence this length across different RNN variants.

Contribution

It provides a systematic approach to quantify dependency lengths and analyzes the impact of architectural choices on RNN memory capabilities.

Findings

01

Architectural changes significantly affect maximum dependency lengths.

02

Gated units like GRU and LSTM show different dependency behaviors than traditional RNNs.

03

Increasing layers and neurons can extend the dependency length capacity.

Abstract

This work proposes a methodology for determining the maximum dependency length of a recurrent neural network (RNN), and then studies the effects of architectural changes, including the number and neuron count of layers, on the maximum dependency lengths of traditional RNN, gated recurrent unit (GRU), and long-short term memory (LSTM) models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications