TL;DR
This paper introduces a Lie algebra-based parametrization for unitary operators in recurrent neural networks, enabling effective gradient-based learning and addressing vanishing/exploding gradient issues.
Contribution
It presents a novel Lie algebra-based parametrization of unitary matrices that simplifies training and improves performance over existing methods.
Findings
Outperforms previous low-dimensional parametrizations.
Successfully learns arbitrary unitary operators.
Effectively solves long-memory tasks with unitary RNNs.
Abstract
A major challenge in the training of recurrent neural networks is the so-called vanishing or exploding gradient problem. The use of a norm-preserving transition operator can address this issue, but parametrization is challenging. In this work we focus on unitary operators and describe a parametrization using the Lie algebra associated with the Lie group of unitary matrices. The exponential map provides a correspondence between these spaces, and allows us to define a unitary matrix using real coefficients relative to a basis of the Lie algebra. The parametrization is closed under additive updates of these coefficients, and thus provides a simple space in which to do gradient descent. We demonstrate the effectiveness of this parametrization on the problem of learning arbitrary unitary operators, comparing to several baselines and outperforming a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
