Computational and Statistical Boundaries for Submatrix Localization in a Large Noisy Matrix
T. Tony Cai, Tengyuan Liang, Alexander Rakhlin

TL;DR
This paper investigates the limits of efficiently detecting a signal submatrix in a noisy large matrix, revealing a gap between what is statistically possible and what is computationally feasible.
Contribution
It establishes computational and statistical thresholds for submatrix localization and introduces efficient algorithms that operate near the computational boundary.
Findings
Identifies a computational threshold SNR_c for polynomial-time success.
Defines a statistical threshold SNR_s below which detection is impossible.
Shows a significant gap between computational and statistical boundaries.
Abstract
The interplay between computational efficiency and statistical accuracy in high-dimensional inference has drawn increasing attention in the literature. In this paper, we study computational and statistical boundaries for submatrix localization. Given one observation of (one or multiple non-overlapping) signal submatrix (of magnitude and size ) contaminated with a noise matrix (of size ), we establish two transition thresholds for the signal to noise ratio in terms of , , , and . The first threshold, , corresponds to the computational boundary. Below this threshold, it is shown that no polynomial time algorithm can succeed in identifying the submatrix, under the \textit{hidden clique hypothesis}. We introduce adaptive linear time spectral algorithms that identify the submatrix with high probability when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
