PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning

Mingqi Wu; Qiang Sun; Yi Yang

arXiv:2511.12278·stat.ML·November 18, 2025

PCA++: How Uniformity Induces Robustness to Background Noise in Contrastive Learning

Mingqi Wu, Qiang Sun, Yi Yang

PDF

Open Access

TL;DR

PCA++ introduces a uniformity-constrained contrastive PCA method that enhances robustness to background noise in high-dimensional data, outperforming standard PCA in various noisy scenarios.

Contribution

The paper proposes PCA++, a novel contrastive PCA method with a closed-form solution that regularizes against background noise, improving signal recovery in high-dimensional settings.

Findings

01

PCA++ outperforms standard PCA and PCA+ in noisy data scenarios.

02

Uniformity constraints improve robustness to structured background noise.

03

Theoretical analysis confirms stability and regularization benefits of PCA++.

Abstract

High-dimensional data often contain low-dimensional signals obscured by structured background noise, which limits the effectiveness of standard PCA. Motivated by contrastive learning, we address the problem of recovering shared signal subspaces from positive pairs, paired observations sharing the same signal but differing in background. Our baseline, PCA+, uses alignment-only contrastive learning and succeeds when background variation is mild, but fails under strong noise or high-dimensional regimes. To address this, we introduce PCA++, a hard uniformity-constrained contrastive PCA that enforces identity covariance on projected features. PCA++ has a closed-form solution via a generalized eigenproblem, remains stable in high dimensions, and provably regularizes against background interference. We provide exact high-dimensional asymptotics in both fixed-aspect-ratio and growing-spike…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis