PCA recovery thresholds in low-rank matrix inference with sparse noise

Urte Adomaityte; Gabriele Sicuro; Pierpaolo Vivo

arXiv:2511.11927·stat.ML·November 18, 2025

PCA recovery thresholds in low-rank matrix inference with sparse noise

Urte Adomaityte, Gabriele Sicuro, Pierpaolo Vivo

PDF

Open Access

TL;DR

This paper analyzes the detectability of a rank-one signal in high-dimensional matrices corrupted by sparse noise, extending the BBP transition to sparse noise scenarios using replica methods.

Contribution

It introduces a theoretical framework for understanding eigenvalue and eigenvector behavior in sparse noise settings, generalizing the BBP transition.

Findings

01

Identifies the critical signal strength for recovery in sparse noise

02

Derives recursive equations for eigenvector distributions

03

Validates analytical predictions with numerical simulations

Abstract

We study the high-dimensional inference of a rank-one signal corrupted by sparse noise. The noise is modelled as the adjacency matrix of a weighted undirected graph with finite average connectivity in the large size limit. Using the replica method from statistical physics, we analytically compute the typical value of the top eigenvalue, the top eigenvector component density, and the overlap between the signal vector and the top eigenvector. The solution is given in terms of recursive distributional equations for auxiliary probability density functions which can be efficiently solved using a population dynamics algorithm. Specialising the noise matrix to Poissonian and Random Regular degree distributions, the critical signal strength is analytically identified at which a transition happens for the recovery of the signal via the top eigenvector, thus generalising the celebrated BBP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques