On the Role of Depth in the Expressivity of RNNs

Maude Lizaire; Michael Rizvi-Martel; \'Eric Dupuis; Guillaume Rabusseau

arXiv:2604.02201·cs.LG·April 3, 2026

On the Role of Depth in the Expressivity of RNNs

Maude Lizaire, Michael Rizvi-Martel, \'Eric Dupuis, Guillaume Rabusseau

PDF

TL;DR

This paper investigates how depth enhances the expressivity and memory capacity of RNNs and 2RNNs, showing that depth enables more complex transformations and better information retention.

Contribution

It provides a formal analysis of depth's role in RNNs and 2RNNs, demonstrating increased expressivity and memory capacity, and highlights the unique benefits of multiplicative interactions.

Findings

01

Depth increases RNNs' memory capacity efficiently.

02

2RNNs perform polynomial transformations with degree growing with depth.

03

Multiplicative interactions cannot be replaced by layerwise nonlinearities.

Abstract

The benefits of depth in feedforward neural networks are well known: composing multiple layers of linear transformations with nonlinear activations enables complex computations. While similar effects are expected in recurrent neural networks (RNNs), it remains unclear how depth interacts with recurrence to shape expressive power. Here, we formally show that depth increases RNNs' memory capacity efficiently with respect to the number of parameters, thus enhancing expressivity both by enabling more complex input transformations and improving the retention of past information. We broaden our analysis to 2RNNs, a generalization of RNNs with multiplicative interactions between inputs and hidden states. Unlike RNNs, which remain linear without nonlinear activations, 2RNNs perform polynomial transformations whose maximal degree grows with depth. We further show that multiplicative interactions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.