Distributed Learning for Principle Eigenspaces without Moment Constraints
Yong He, Zichen Liu, Yalin Wang

TL;DR
This paper introduces a distributed PCA algorithm that estimates principal eigenspaces without moment constraints, effective for heavy-tailed data, and demonstrates its theoretical properties and practical advantages over existing methods.
Contribution
It proposes a novel distributed eigenspace estimation method using Kendall's tau matrix under elliptical distributions, removing moment constraints and improving robustness.
Findings
Performs comparably with existing methods on light-tailed data.
Outperforms in heavy-tailed data scenarios.
Extends to elliptical factor models with real data application.
Abstract
Distributed Principal Component Analysis (PCA) has been studied to deal with the case when data are stored across multiple machines and communication cost or privacy concerns prohibit the computation of PCA in a central location. However, the sub-Gaussian assumption in the related literature is restrictive in real application where outliers or heavy-tailed data are common in areas such as finance and macroeconomic. In this article, we propose a distributed algorithm for estimating the principle eigenspaces without any moment constraint on the underlying distribution. We study the problem under the elliptical family framework and adopt the sample multivariate Kendall'tau matrix to extract eigenspace estimators from all sub-machines, which can be viewed as points in the Grassman manifold. We then find the "center" of these points as the final distributed estimator of the principal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Statistical Methods and Inference · Blind Source Separation Techniques
