Wasserstein Regression with Empirical Measures and Density Estimation for Sparse Data
Yidong Zhou, Hans-Georg M\"uller

TL;DR
This paper introduces a novel Wasserstein regression method using empirical measures and density estimation, effectively handling sparse data situations where traditional density estimation fails, and demonstrates superior performance through simulations and real-world case studies.
Contribution
It proposes a new distribution-response regression approach based on empirical measures that works well with sparse data and avoids complex density estimation issues.
Findings
Outperforms existing methods in simulations.
Effective with small sample sizes for distributions.
Validated on environmental and auction datasets.
Abstract
The problem of modeling the relationship between univariate distributions and one or more explanatory variables has found increasing interest. Traditional functional data methods cannot be applied directly to distributional data because of their inherent constraints. Modeling distributions as elements of the Wasserstein space, a geodesic metric space equipped with the Wasserstein metric that is related to optimal transport, is attractive for statistical applications. Existing approaches proceed by substituting proxy estimated distributions for the typically unknown response distributions. These estimates are obtained from available data but are problematic when for some of the distributions only few data are available. Such situations are common in practice and cannot be addressed with available approaches, especially when one aims at density estimates. We show how this and other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFibromyalgia and Chronic Fatigue Syndrome Research
