Adjoint Differentiation for generic matrix functions
Andrei Goloubentsev, Dmitri Goloubentsev, Evgeny Lakshtanov

TL;DR
This paper develops a general formula for the adjoint of matrix functions, enabling efficient differentiation in applications like spectral decomposition, correlation matrices, and regularized linear regression.
Contribution
It introduces a unified adjoint differentiation formula for matrix functions, including specific cases like spectral decomposition and correlation matrices, simplifying complex derivative computations.
Findings
Derived a general adjoint formula for matrix functions
Provided closed-form expressions for spectral decomposition and correlation matrices
Simplified adjoint computations for regularized linear regression
Abstract
We derive a formula for the adjoint of a square-matrix operation of the form , where is holomorphic in the neighborhood of each eigenvalue. We then apply the formula to derive closed-form expressions in particular cases of interest such as the case when we have a spectral decomposition , the spectrum cut-off and the Nearest Correlation Matrix routine. Finally, we explain how to simplify the computation of adjoints for regularized linear regression coefficients.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Numerical methods in inverse problems · Statistical and numerical algorithms
MethodsLinear Regression
