Cheap Orthogonal Constraints in Neural Networks: A Simple Parametrization of the Orthogonal and Unitary Group
Mario Lezcano-Casado, David Mart\'inez-Rubio

TL;DR
This paper presents a simple, efficient parametrization for orthogonal and unitary constraints in neural networks using Lie group theory, enabling easier optimization and improved RNN performance.
Contribution
It introduces a Lie group-based exponential map parametrization that simplifies optimization under orthogonal constraints in neural networks.
Findings
Faster and more stable convergence in RNN training.
Efficient implementation with negligible runtime overhead.
Improved robustness in orthogonal constrained optimization.
Abstract
We introduce a novel approach to perform first-order optimization with orthogonal and unitary constraints. This approach is based on a parametrization stemming from Lie group theory through the exponential map. The parametrization transforms the constrained optimization problem into an unconstrained one over a Euclidean space, for which common first-order optimization methods can be used. The theoretical results presented are general enough to cover the special orthogonal group, the unitary group and, in general, any connected compact Lie group. We discuss how this and other parametrizations can be computed efficiently through an implementation trick, making numerically complex parametrizations usable at a negligible runtime cost in neural networks. In particular, we apply our results to RNNs with orthogonal recurrent weights, yielding a new architecture called expRNN. We demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Advanced Neural Network Applications · Neural Networks and Applications
