One-Sided Matrix Completion from Ultra-Sparse Samples
Hongyang R. Zhang, Zhenshuo Zhang, Huy L. Nguyen, Guanghui Lan

TL;DR
This paper introduces a novel method for estimating the row span and second-moment matrix of large, sparse matrices from ultra-sparse samples, using an unbiased estimator and gradient descent, with theoretical guarantees and practical validation.
Contribution
It develops an unbiased estimator for the second-moment matrix in ultra-sparse sampling regimes and proves its effectiveness under certain conditions, with empirical validation on real-world datasets.
Findings
Estimator is unbiased for any sampling probability p
Gradient descent recovers the second-moment matrix with low error
Method significantly reduces bias and error on real datasets
Abstract
Matrix completion is a classical problem that has received recurring interest across a wide range of fields. In this paper, we revisit this problem in an ultra-sparse sampling regime, where each entry of an unknown, matrix (with ) is observed independently with probability , for a fixed integer . This setting is motivated by applications involving large, sparse panel datasets, where the number of rows far exceeds the number of columns. When each row contains only entries -- fewer than the rank of -- accurate imputation of is impossible. Instead, we estimate the row span of or the averaged second-moment matrix . The empirical second-moment matrix computed from observed entries exhibits non-random and sparse missingness. We propose an unbiased estimator that normalizes each nonzero entry of the second…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Random Matrices and Applications
