Robust Automatic Differentiation of Square-Root Kalman Filters via Gramian Differentials
Adrien Corenflos

TL;DR
This paper introduces a robust method for differentiating square-root Kalman filters by leveraging Gramian differentials, ensuring stable gradients even with rank-deficient matrices, which enhances parameter learning in state-space models.
Contribution
It derives a closed-form chain-rule for differentiating through the Gramian of the matrix, resolving issues with non-uniqueness and divergence in standard methods, and extends it to rank-deficient inputs.
Findings
Exact gradient computation for Kalman log-marginal likelihood
Stable differentiation through Gramian-based methods
Extension to rank-deficient matrices using pseudoinverse and null-space correction
Abstract
Square-root Kalman filters propagate state covariances in Cholesky-factor form for numerical stability, and are a natural target for gradient-based parameter learning in state-space models. Their core operation, triangularization of a matrix , is computed via a QR decomposition in practice, but naively differentiating through it causes two problems: the semi-orthogonal factor is non-unique when , yielding undefined gradients; and the standard Jacobian formula involves inverses, which diverges when is rank-deficient. Both are resolved by the observation that all filter outputs relevant to learning depend on the input matrix only through the Gramian , so the composite loss is smooth in even where the triangularization is not. We derive a closed-form chain-rule directly from the differential of this Gramian identity, prove it exact for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Target Tracking and Data Fusion in Sensor Networks · Matrix Theory and Algorithms
