The hierarchical barycenter: conditional probability simulation with structured and unobserved covariates
Esteban G. Tabak, Giulio Trigila, Wenjun Zhao

TL;DR
This paper introduces a hierarchical barycenter method for simulating conditional probability densities in unstructured datasets with missing or partially overlapping covariates, extending optimal transport theory.
Contribution
It develops a novel hierarchical barycenter approach based on optimal transport, with a new numerical procedure, for better handling unstructured and incomplete data in probability simulation.
Findings
The method effectively handles datasets with missing covariates.
It improves simulation accuracy over classical barycenter methods.
Validated on synthetic and real-world data.
Abstract
This paper presents a new method for conditional probability density simulation. The method is design to work with unstructured data set when data are not characterized by the same covariates yet share common information. Specific examples considered in the text are relative to two main classes: homogeneous data characterized by samples with missing value for the covariates and data set divided in two or more groups characterized by covariates that are only partially overlapping. The methodology is based on the mathematical theory of optimal transport extending the barycenter problem to the newly defined hierarchical barycenter problem. A newly, data driven, numerical procedure for the solution of the hierarchical barycenter problem is proposed and its advantages, over the use of classical barycenter, are illustrated on synthetic and real world data sets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Modeling and Causal Inference · Statistical Methods and Bayesian Inference
