Learning Multivariate CDFs and Copulas using Tensor Factorization
Magda Amiridi, Nicholas D. Sidiropoulos

TL;DR
This paper introduces a tensor factorization approach to learn multivariate CDFs and copulas, providing a scalable, identifiable, and efficient alternative to traditional density-based methods, with applications in regression, sampling, and imputation.
Contribution
It proposes a novel low-rank tensor decomposition method for multivariate CDFs and copulas, overcoming limitations of existing density-focused approaches and offering theoretical guarantees.
Findings
Superior performance on synthetic and real datasets
Efficient sampling and inference capabilities
Better density estimates via CDF models than PDF models
Abstract
Learning the multivariate distribution of data is a core challenge in statistics and machine learning. Traditional methods aim for the probability density function (PDF) and are limited by the curse of dimensionality. Modern neural methods are mostly based on black-box models, lacking identifiability guarantees. In this work, we aim to learn multivariate cumulative distribution functions (CDFs), as they can handle mixed random variables, allow efficient box probability evaluation, and have the potential to overcome local sample scarcity owing to their cumulative nature. We show that any grid sampled version of a joint CDF of mixed random variables admits a universal representation as a naive Bayes model via the Canonical Polyadic (tensor-rank) decomposition. By introducing a low-rank model, either directly in the raw data domain, or indirectly in a transformed (Copula) domain, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Computational Physics and Python Applications · Advanced Neural Network Applications
