On unsupervised projections and second order signals
Thomas Lartigue, Sach Mukherjee

TL;DR
This paper investigates whether unsupervised linear projections like PCA can preserve second order differences, such as covariance structures, between latent groups in high-dimensional data, with implications for biomedical data analysis.
Contribution
The study provides a theoretical and empirical comparison of PCA and random projections in preserving second order signals in unsupervised high-dimensional data analysis.
Findings
PCA generally outperforms random projections in retaining second order signals.
PCA is often competitive with supervised projections for second order structure.
Projection dimension influences the bias-variance trade-off in signal preservation.
Abstract
Linear projections are widely used in the analysis of high-dimensional data. In unsupervised settings where the data harbour latent classes/clusters, the question of whether class discriminatory signals are retained under projection is crucial. In the case of mean differences between classes, this question has been well studied. However, in many contemporary applications, notably in biomedicine, group differences at the level of covariance or graphical model structure are important. Motivated by such applications, in this paper we ask whether linear projections can preserve differences in second order structure between latent groups. We focus on unsupervised projections, which can be computed without knowledge of class labels. We discuss a simple theoretical framework to study the behaviour of such projections which we use to inform an analysis via quasi-exhaustive enumeration. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Blind Source Separation Techniques · Face and Expression Recognition
MethodsPrincipal Components Analysis
