Fast Approximation of Rotations and Hessians matrices
Michael Mathieu, Yann LeCun

TL;DR
This paper introduces a fast, learnable method for approximating rotation matrices and Hessians with linearithmic complexity, enabling efficient covariance and inverse Hessian estimation in machine learning tasks.
Contribution
The paper presents a novel FFT-like approach to approximate rotation matrices and Hessians using gradient descent, improving computational efficiency in high-dimensional settings.
Findings
Effective approximation of synthetic matrices
Accurate covariance matrix estimation for real data
Reliable inverse Hessian tracking in ML optimization
Abstract
A new method to represent and approximate rotation matrices is introduced. The method represents approximations of a rotation matrix with linearithmic complexity, i.e. with rotations over pairs of coordinates, arranged in an FFT-like fashion. The approximation is "learned" using gradient descent. It allows to represent symmetric matrices as where is a diagonal matrix. It can be used to approximate covariance matrix of Gaussian models in order to speed up inference, or to estimate and track the inverse Hessian of an objective function by relating changes in parameters to changes in gradient along the trajectory followed by the optimization procedure. Experiments were conducted to approximate synthetic matrices, covariance matrices of real data, and Hessian matrices of objective functions involved in machine learning problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Neural Networks and Applications · Scientific Research and Discoveries
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
