Parallelized Computation and Backpropagation Under Angle-Parametrized Orthogonal Matrices
Firas Hamze

TL;DR
This paper introduces a parallelized method for efficient computation and backpropagation of orthogonal matrices in machine learning, leveraging graph coloring techniques for faster algorithms.
Contribution
It presents a novel approach to restructure orthogonal matrix parametrization into parallelizable blocks, enabling faster computation and gradient backpropagation.
Findings
O(n) algorithm for matrix computation
O(n log n) gradient computation
Promising GPU performance results
Abstract
We present a methodology for parallel acceleration of learning in the presence of matrix orthogonality and unitarity constraints of interest in several branches of machine learning. We show how an apparently sequential elementary rotation parametrization can be restructured into blocks of commutative operations using a well-known tool for coloring the edges of complete graphs, in turn widely applied to schedule round-robin (all-against-all) sports tournaments. The resulting decomposition admits an algorithm to compute a fully-parametrized orthogonal matrix from its rotation parameters in sequential steps and one to compute the gradient of a training loss with respect to its parameters in steps. We discuss parametric restrictions of interest to generative modeling and present promising performance results with a prototype GPU implementation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Neural Networks and Applications · Model Reduction and Neural Networks
