Distributional Matrix Completion via Nearest Neighbors in the Wasserstein Space
Jacob Feitelberg, Kyuseong Choi, Anish Agarwal, Raaz Dwivedi

TL;DR
This paper introduces a novel distributional matrix completion method using optimal transport and Wasserstein space, improving distribution estimates and supporting heteroscedasticity, with theoretical and empirical validation.
Contribution
It generalizes matrix completion to distributional data using Wasserstein distances and establishes recovery guarantees under a latent factor model.
Findings
Better distributional estimates than observed samples alone
Accurate estimation of distributional quantities like standard deviation and VaR
Supports heteroscedastic distributions inherently
Abstract
We study the problem of distributional matrix completion: Given a sparsely observed matrix of empirical distributions, we seek to impute the true distributions associated with both observed and unobserved matrix entries. This is a generalization of traditional matrix completion, where the observations per matrix entry are scalar-valued. To do so, we utilize tools from optimal transport to generalize the nearest neighbors method to the distributional setting. Under a suitable latent factor model on probability distributions, we establish that our method recovers the distributions in the Wasserstein metric. We demonstrate through simulations that our method (i) provides better distributional estimates for an entry compared to using observed samples for that entry alone, (ii) yields accurate estimates of distributional quantities such as standard deviation and value-at-risk, and (iii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Advanced Algebra and Geometry · Mathematical Analysis and Transform Methods
