Full-Capacity Unitary Recurrent Neural Networks
Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, and, Les Atlas

TL;DR
This paper introduces full-capacity uRNNs that optimize over all unitary matrices, overcoming limitations of previous parameterizations and significantly improving performance on sequential data tasks.
Contribution
It provides a theoretical analysis of parameterization capacity and develops a method to optimize full-capacity unitary matrices in uRNNs.
Findings
Full-capacity uRNNs outperform restricted-capacity models.
Theoretical capacity analysis reveals limitations of existing parameterizations.
Empirical results show improved performance on synthetic and natural data.
Abstract
Recurrent neural networks are powerful models for processing sequential data, but they are generally plagued by vanishing and exploding gradient problems. Unitary recurrent neural networks (uRNNs), which use unitary recurrence matrices, have recently been proposed as a means to avoid these issues. However, in previous experiments, the recurrence matrices were restricted to be a product of parameterized unitary matrices, and an open question remains: when does such a parameterization fail to represent all unitary matrices, and how does this restricted representational capacity limit what can be learned? To address this question, we propose full-capacity uRNNs that optimize their recurrence matrix over all unitary matrices, leading to significantly improved performance over uRNNs that use a restricted-capacity recurrence matrix. Our contribution consists of two main components. First, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning · Blind Source Separation Techniques
