Tensors, Learning, and 'Kolmogorov Extension' for Finite-alphabet Random Vectors
Nikos Kargas, Nicholas D. Sidiropoulos, Xiao Fu

TL;DR
This paper introduces a non-parametric method for estimating the joint probability mass function of multiple variables using low-dimensional distributions, leveraging multilinear algebra and inspired by Kolmogorov's extension theorem.
Contribution
It demonstrates that the joint PMF of all variables can be recovered from only three-dimensional distributions under mild conditions, with practical algorithms and experimental validation.
Findings
Joint PMF can be recovered from 3D distributions if the process has limited complexity.
The method requires fewer distributions than traditional approaches, sometimes only 2D.
Experimental results show effectiveness on real data like movie recommendations.
Abstract
Estimating the joint probability mass function (PMF) of a set of random variables lies at the heart of statistical learning and signal processing. Without structural assumptions, such as modeling the variables as a Markov chain, tree, or other graphical model, joint PMF estimation is often considered mission impossible - the number of unknowns grows exponentially with the number of variables. But who gives us the structural model? Is there a generic, `non-parametric' way to control joint PMF complexity without relying on a priori structural assumptions regarding the underlying probability model? Is it possible to discover the operational structure without biasing the analysis up front? What if we only observe random subsets of the variables, can we still reliably estimate the joint PMF of all? This paper shows, perhaps surprisingly, that if the joint PMF of any three variables can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
