One-Sided Matrix Completion from Ultra-Sparse Samples

Hongyang R. Zhang; Zhenshuo Zhang; Huy L. Nguyen; Guanghui Lan

arXiv:2601.12213·cs.LG·January 21, 2026

One-Sided Matrix Completion from Ultra-Sparse Samples

Hongyang R. Zhang, Zhenshuo Zhang, Huy L. Nguyen, Guanghui Lan

PDF

Open Access

TL;DR

This paper introduces a novel method for estimating the row span and second-moment matrix of large, sparse matrices from ultra-sparse samples, using an unbiased estimator and gradient descent, with theoretical guarantees and practical validation.

Contribution

It develops an unbiased estimator for the second-moment matrix in ultra-sparse sampling regimes and proves its effectiveness under certain conditions, with empirical validation on real-world datasets.

Findings

01

Estimator is unbiased for any sampling probability p

02

Gradient descent recovers the second-moment matrix with low error

03

Method significantly reduces bias and error on real datasets

Abstract

Matrix completion is a classical problem that has received recurring interest across a wide range of fields. In this paper, we revisit this problem in an ultra-sparse sampling regime, where each entry of an unknown, $n \times d$ matrix $M$ (with $n \geq d$ ) is observed independently with probability $p = C / d$ , for a fixed integer $C \geq 2$ . This setting is motivated by applications involving large, sparse panel datasets, where the number of rows far exceeds the number of columns. When each row contains only $C$ entries -- fewer than the rank of $M$ -- accurate imputation of $M$ is impossible. Instead, we estimate the row span of $M$ or the averaged second-moment matrix $T = M^{⊤} M / n$ . The empirical second-moment matrix computed from observed entries exhibits non-random and sparse missingness. We propose an unbiased estimator that normalizes each nonzero entry of the second…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Random Matrices and Applications