Distribution-in-distribution-out Regression
Xiaoyu Chen, Mengfan Fu, Yujing Huang, Xinwei Deng

TL;DR
This paper introduces a novel regression model for probability measure inputs and outputs in Wasserstein space, utilizing parallel transport to enable additive operations and improve estimation and prediction.
Contribution
It proposes the distribution-in-distribution-out (DIDO) regression model with parallel transport, establishing a theoretical foundation and practical methods for regression in Wasserstein space.
Findings
The DIDO model achieves provable additivity and commutativity.
The Fréchet least squares estimator provides unbiased estimates.
Application to cardiac output prediction demonstrates practical effectiveness.
Abstract
Regression analysis with probability measures as input predictors and output response has recently drawn great attention. However, it is challenging to handle multiple input probability measures due to the non-flat Riemannian geometry of the Wasserstein space, hindering the definition of arithmetic operations, hence additive linear structure is not well-defined. In this work, a distribution-in-distribution-out regression model is proposed by introducing parallel transport to achieve provable commutativity and additivity of newly defined arithmetic operations in Wasserstein space. The appealing properties of the DIDO regression model can serve a foundation for model estimation, prediction, and inference. Specifically, the Fr\'echet least squares estimator is employed to obtain the best linear unbiased estimate, supported by the newly established Fr\'echet Gauss-Markov Theorem.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models
