Parallelized Computation and Backpropagation Under Angle-Parametrized   Orthogonal Matrices

Firas Hamze

arXiv:2106.00003·cs.LG·June 2, 2021

Parallelized Computation and Backpropagation Under Angle-Parametrized Orthogonal Matrices

Firas Hamze

PDF

Open Access

TL;DR

This paper introduces a parallelized method for efficient computation and backpropagation of orthogonal matrices in machine learning, leveraging graph coloring techniques for faster algorithms.

Contribution

It presents a novel approach to restructure orthogonal matrix parametrization into parallelizable blocks, enabling faster computation and gradient backpropagation.

Findings

01

O(n) algorithm for matrix computation

02

O(n log n) gradient computation

03

Promising GPU performance results

Abstract

We present a methodology for parallel acceleration of learning in the presence of matrix orthogonality and unitarity constraints of interest in several branches of machine learning. We show how an apparently sequential elementary rotation parametrization can be restructured into blocks of commutative operations using a well-known tool for coloring the edges of complete graphs, in turn widely applied to schedule round-robin (all-against-all) sports tournaments. The resulting decomposition admits an algorithm to compute a fully-parametrized orthogonal matrix from its rotation parameters in $O (n)$ sequential steps and one to compute the gradient of a training loss with respect to its parameters in $O (n lo g n)$ steps. We discuss parametric restrictions of interest to generative modeling and present promising performance results with a prototype GPU implementation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Neural Networks and Applications · Model Reduction and Neural Networks