An Implicit Form of Krasulina's k-PCA Update without the Orthonormality Constraint
Ehsan Amid, Manfred K. Warmuth

TL;DR
This paper introduces an implicit Krasulina's k-PCA update that bypasses the orthonormality constraint, leading to faster convergence, improved stability, and suitability for distributed and parallel implementations.
Contribution
It derives a novel implicit Krasulina's update that avoids QR-decomposition, connects it to an online EM step, and demonstrates its advantages over traditional methods.
Findings
Faster convergence compared to existing updates
More stable with respect to learning rate tuning
Effective in distributed and parallel settings
Abstract
We shed new insights on the two commonly used updates for the online -PCA problem, namely, Krasulina's and Oja's updates. We show that Krasulina's update corresponds to a projected gradient descent step on the Stiefel manifold of the orthonormal -frames, while Oja's update amounts to a gradient descent step using the unprojected gradient. Following these observations, we derive a more \emph{implicit} form of Krasulina's -PCA update, i.e. a version that uses the information of the future gradient as much as possible. Most interestingly, our implicit Krasulina update avoids the costly QR-decomposition step by bypassing the orthonormality constraint. We show that the new update in fact corresponds to an online EM step applied to a probabilistic -PCA model. The probabilistic view of the updates allows us to combine multiple models in a distributed setting. We show experimentally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
